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Preface 


There are some numbers that, throughout, history have become enormously 
famous for diverse reasons. In some cases, their fame is simply due to historical 
coincidences. Sometimes the number in question appears at the centre of some 
very relevant scientific finding, as was the case with the square root of 2 linked to 
the discovery of irrational numbers and also the number x linked, among other 
things, to the famous problem of squaring the circle. 

Other numbers, on the other hand, have become widely known because they 
are involved in very useful and interesting formulae. This is another reason for 71’s 
reputation, which is essential, for example, for calculating the circumference or the 
area of a circle. However, there can also be aesthetic reasons, such as when a certain 
number appears in formulae or theorems that are elegant or surprising. This is due 
to the fame of the so-called golden ratio, which is usually indicated by the Greek 
letter ® (read as fi), a number that allows you to design rectangles with proportions 
that are pleasing to the eye. Therefore, it appears in numerous architectural and 
graphic designs. 

In this book, we will discuss a number no less famous than those just mentioned 
above. This is e, the number, which, in 1731, the Swiss mathematician Leonhard 
Euler decided to denote by this letter, although, as we shall see, it was known for 
almost two centuries before being named Euler. 

And why is it so famous? Perhaps because it is involved in incredibly useful 
formulae or for certain historical or aesthetic reasons? In the case of e, all three 
answers are true. In fact, the number e forms part, as we will see later, of very useful 
formulae. For example, those describing the growth in capital due to compound 
interest, the growth of bacterial populations or the decay of radioactive substances. 
Furthermore, it appears in formulae essential for the calculation of probabilities and 
plays a central role in differential calculation — one of the branches of mathematics 
with more-practical applications. 

Nor can its historical significance be denied because, as we shall discover, the 
number e is strongly associated with both the discovery of logarithms and the 
problem of squaring the circle. 

And as for the aesthetic aspects...Did you know that the number is part of what 
is considered the ‘most beautiful formula in the world’? In 1988, Euler’s famous 
formula, e” + 1=0 was chosen by readers of the Mathematical Intelligencer journal as 


the ‘most beautiful [mathematical formula] in history’. 


PREFACE 


In short, this work certainly speaks of a number. But it is not just merely a 
number because there is so much to say about it...So much to explain and this is 
what we will do from the first page of this book. We’ll explain the ins and outs 
of e in the most enjoyable way possible. For as the astronomer Tycho Brahe said 
in 1596, ‘Mathematical truth prefers simple words, since the language of truth is 


itself simple’. 


Chapter 1 


Discovery of the number e 


What does it mean to discover a number? Was the number hidden in the hope that its 
existence might somehow be recognised? The answer is ‘yes’. Although today we know 
that the number e is essential for the mathematical description of many phenomena, 
whether physical, economic, biological or other, that central role took millennia to be 
noticed. And we speak of millennia because, unlike 1, which was already known by the 
ancient Egyptians, the Sumerians, the Chinese and other cultures that existed centuries 
before our time, the number e entered the history of mathematics relatively recently 
around the start of the 17th century. This is very likely because, whilst 7 has a very sim- 
ple geometric definition (it is the ratio of a circle’s circumference to its diameter), the 
number e — as we will see below — has a more complex definition. Indeed, so much so 
that there were several ‘partial sightings’ of e before it was finally found. 

Here we are going to exactly discuss the definition of the number e and how its 
first appearances in the history of mathematics were closely linked to the history of 


logarithms. Let’s start with them... 


Logarithms, a great idea to simplify calculations 


The 16th and 17th centuries were, for Europe, a time of great economic expansion. 
The conquest of America and the subsequent exploitation of its natural wealth 
along with that of resources from Asian and African colonies determined the end of 
the Middle Ages and the rise of a new social class — the bourgeoisie. The Medieval 
feudal society was gradually replaced by a mercantile society in which the accumu- 
lation and circulation of money contributed to the society. 

In this same period, mathematics, astronomy and physics — sciences that been 
virtually stagnant in Europe since the 4th century — flourished and underwent an 
unstoppable process of development. Among the great scientists of the time were, 
to name but a few, Nicolaus Copernicus (1473-1543), Michael Stifel (1487-1567), 
Galileo Galilei (1564-1642) and Johannes Kepler (1571-1630). 
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These scientific and economic changes drove the need for ever more com- 
plex calculations, whether these were to compute the planetary orbits, establish 
the position of a ship on the high seas or to quantify profits from the distribu- 
tion and sale of certain goods brought from overseas. For this reason, various 
methods were developed to perform difficult calculations quickly and accurate- 
ly. And there were also inventions, such as Pascal’s Calculator (Pascaline) devised 
in 1642 by the mathematician and philosopher Blaise Pascal (1623-1662). A 
machine, based on a system of wheels and gears, it quickly added and subtracted 
numbers from several figures. It is considered the precursor to modern elec- 
tronic calculators. 

However, of all the systems used for making difficult calculations, undoubtedly 
the most important and durable was the logarithmic method, developed independent- 
ly by the Swiss Jobst Biirgi and the Scotsman John Napier between the end of the 
16th century and the middle of the 17th century. This method was used for more 
than 350 years, from when Napier made it public in 1614 up until the widespread 
use of electronic calculators in the 1970s. An anecdote: Jobst Burgi was the first 
person to conceive the idea of logarithms in 1586, but he did not make it public 
until 1620. On the other hand, Napier published his findings in 1614 and there- 
fore is considered by many historians as the discoverer of those numbers so key to 
simplifying calculations. 

It is interesting to note that the way in which Napier and Biirgi set out their 
logarithms is different to how they are defined today. Presently, logarithms are 
associated with the notion of the exponent. According to the modern explana- 
tion of logarithms, in the equation 2°=8, the number 2 is the base of the pow- 
er and 3 is the exponent. Now, in this same equation ‘3 1s the logarithm to base 
2 of 8’, a phrase that in mathematical symbolisation, is written as log,(8) =3, 
because 3 is the exponent to be raised to 2 to make 8. Generalising this exam- 
ple, the definition tells us that the base 2 logarithm of a number A (which must 
necessarily be positive) is the exponent to which the number 2 must be raised 


so the result is A: 


is equal to 


ri 


e \ 
log.(A) = Buf 2° A 


& 
tii Pl 


raised to 
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JOHN NAPIER (1550-1617) AND THE ORIGIN OF LOGARITHMS 
i a 


John Napier was born in Merchiston Castle, Scot- 


land in 1550. The Napier family, one of the most 


influential in the kingdom, owned the castle and 


had ruled the surrounding region since 1430. 


From a very young age, Napier showed a great 


flair for mathematics and that is why, in 1563, at 


the age of 13, he enrolled at the University of St 


Andrews to read this science. However, soon after- 


wards, perhaps as a consequence of his mother’s 


death, he became more interested in the study of 


theology. In fact, from then on, he always con- 


sidered that theology was his main field of work, 


whereas mathematics was only a secondary hobby. 


Despite this Napier is remembered, above all, for his work with logarithms. 


And the usefulness of the logarithm method to rapidly perform complex calculations greatly 


facilitated the work of scientists. As Pierre Simon Laplace (1749-1827), one of his great 


admirers wrote: “By shortening his work, he doubled the lives of astronomers”. John Napier 
died in Edinburgh, Scotland, on 4 April 1617. 


In this respect log, (16) = 4 because 24 = 16 and log,(1) = 0 because 2° = 1. 


The number 2 is, in these examples, the base of the logarithm. But this base can 
actually be any other positive number (other than 1). So, for example, as 3?= 9, then 
log,(9) =2 and log, (64) =3 because 4° = 64. 

However, logarithms, as conceived by Napier and Biirgi, were defined in a 
different way. For these mathematicians, the logarithms arose from the comparison 
of two successions of numbers: an arithmetic sequence and a geometric sequence. An 
arithmetic sequence is one that begins with any number, for example 0, to which a 
fixed quantity is added. For example, if at number 0, we add 1 each time, we obtain 
the sequence 0; 1,2. 3,4,5, 

In a geometric sequence, we also start with any number. But, in this case, we 
multiply it by a fixed value, which is called the sequence ratio. For example, if 
we start with 1 and multiply each time by 2 we get 1, 2, 4, 8, 16, ...as we shall 


see on the following page: 
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0 1 2 ] 4 5 6 7 8 9 10° 
Z 4 8 16 32 O 2s 260 G2 ia 


Top, an arithmetic sequence (logarithms); bottom, a geometric sequence (antilogarithms). 


Napier called logarithms the numbers of the first series (the word comes from 
the Greek words logos and arithmos, which respectively mean ratio and number), and 
antilogarithms the numbers of the second series. 

We note that the second sequence is formed by the numbers 2°, 2', 27, 2°,... 
so according to the current definition of logarithms, the numbers in the upper 
row are exactly the base 2 logarithms of the numbers in the bottom row. In effect, 
log, (1) =0, log, (2) = 1, log,(4) =2 and so on. Translated into modern language, we 
can say that the above sequences show us a table of base 2 logarithms. Similarly, if in 
the lower row, we write the sequence 1, 3, 9, 27, 81, ..., we would have a table of 
base 3 logarithms; if, in the lower row, the sequence was 1, 4, 16, 64, 256, ..., we 
would have a table of base 4 logarithms; and so on. 

Now, how do these sequences help to carry out difficult calculations? Although 
logarithms allow for performance of a great variety of calculations, for our pur- 
poses, it will suffice to show a simple example of how logarithms can help to find 
the square root of a number. Let us begin by observing that addition or subtraction 
of two numbers of the upper sequence corresponds to multiplication or division 
of the respective numbers in the lower sequence. For example, the calculation 
2+5=7 in the upper row corresponds to the calculation 4:32 = 128 in the lower 


one, as we can see in the following figure: 


: iden alia 
i 2 6 2 ee 
; @ ¢ 6 SB so 256 Sie 1024 


A sum in the upper row corresponds to a product in the lower row. 


Similarly, if we calculate double a number in the upper row (that is, if we add 


that number to itself), this corresponds, in the lower row, to squaring (that is, it cor- 


12 
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responds to multiplying the number in the lower row by itself). With the same idea, 
calculating half of a number in the upper row is equivalent to calculating the square 


root of that number in the lower row, as shown in the figure: 


median 


1 CR |. 10 
a4 8 32. 64 ee 512 1024 
i ee 


square root 
Half of 8 is 4 and the square root of 256 is 16. 


Let us show two examples of how these relationships help us to calculate, on 
the one hand, the square root of 256 and on the other, an approximation of the 


square root of 2. 


median logarithm . 
median 
@) - @® .....0 


©) @) 


root algorithm number whose root algorithm number whose 
sought root we want sought root we want to 
to calculate calculate 
Calculation of the square root of 256 Calculation of the square root of 2 


To calculate the square root of 256, we must first search for that number in the 
bottom row, as is shown in the left part of the aobe figure. We then turn to the top 
row (we get 8), and we calculate half its value (we get 4). Finally, we return to the 
lower row and arrive at the number 16, which is the square root sought. Indeed, 
/256 = 16.The advantage of the method, of course, is that calculating half a num- 
ber is much simpler than finding its square root. | 

Now let’s look at the second example. To obtain/2 (right part of the above 
figure), we first look at the number 2 in the lower row. We then go to the upper 
row (that is, we search for its logarithm) and we arrive at 1.We then find half of this 


number and we get 0.5. The number sought is, then, the antilogarithm of 0.5. But 
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here a problem arises because, as can be seen in the first table (at the top of page 
12), the antilogarithm of 0.5 is not in the table. Consequently, we are obliged to find 
an approximate value of this antilogarithm, which we will do: in the upper row, 0.5 
is the midpoint between 0 and 1 and then from the lower row we select, by means 
of approximation, the midpoint between 1 and 2.This tells us that the square root 
of 2 is approximately 1.5. 

As the exact value of V2 begins with 1.4142..., we can say that the result ob- 
tained is reasonably correct. 

With the same procedure, how could we calculate the square root of 512? The 
logarithm of 512 is 7, half of which is 3.5. As the midpoint is between 3 and 4, the 
approximation we get for the square root of 512 is the midpoint between 8 and 
16, which is 12 (see the figure below). In other words, our calculation tells us that 
512 is approximately 12. 


median | logarithm 
@.. =, O 
! 
©) G19) 


root algorithm number whose 
sought root we want 
to calculate 


Calculation of the square root of 512 


However, the square root of 512 is 22.674... So why in this case is the ap- 
proximation is not as good as when we calculate/2? The explanation is that by 
equating the midpoint of two numbers in the upper row with the midpoint of 
the corresponding numbers in the lower row, we are implicitly assuming that the 
two sequences grow at the same rate. But that is not true, as it is evident that the 
lower sequence grows faster than the upper one and that is why we only get an ap- 
proximate result, which can be close to the exact result. The above examples show 
the following. As was the case with the calculation of V2, if we work in a section 


of the table where the antilogarithms do not yet have much separation from one 
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JOBST BURGI (1552-1632), A SKILFUL MATHEMATICIAN 
a a eee 


Jobst Burgi was born in the Swiss city of Lichten- 


steig on 28 February 1552. Not only was he an 


astronomer and mathematician, he also excelled in 


the development of precise scientific instruments 
— a skill that he began to hone between 1570 and 


1574 when he helped build the astronomical clock 


of the Cathedral of our Lady of Strasbourg. This 


clock, whose construction had begun in 1547, indi- 


cated the movement of the planets on an astrolabe 


and included a perpetual calendar. 


In 1579, Wilhelm IV, Prince of Hesse-Kassel — an 


independent state now part of Germany — com- 


missioned Burgi to design and build astronomical 


instruments to verify Copernicus’ heliocentric theory. 


Wilhelm himself was a notable astronomer and thanks to the instruments created by Burgi, 


he managed to make remarkably precise observations of the positions of fixed stars. It was 


then that he conceived the idea of logarithms, although, as has already been mentioned, he 


did not publish his finding until some 40 years later. 


In 1601, Burgi travelled to Prague to work as an assistant to Johannes Kepler, and in this 


context, he privately communicated his logarithm-based calculation method. As a result, 


Kepler's laborious astronomical calculations were so simplified that he persuaded Burgi to 


make them public so as to benefit all astronomers in Europe. 


Burgi spent the rest of his life in Prague, although he frequently visited Hesse-Kassel, where 


he died on 31 January 1632. His epitaph defines him as an “ingenious designer of meas- 


urement instruments and celestial vaults, the builder of the most accurate clocks in the 16th 


century and an inventor of logarithms”. 


another, the error that is made by equating the midpoint between two numbers in 


the upper row with that corresponding in the lower row is not very large, so you 
get a very acceptable approach. On the other hand, in a part of the table in which 
the antilogarithms have already been significantly spaced out, the mistake made in 


the approximation will significantly increase, as occurred in our example with the 


calculation of 512. 
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Birgi first discerns the number e 


Both Napier and Biirgi realised that it was necessary to choose a geometric 
sequence that did not grow very quickly, so that the distance between two con- 
secutive terms increased at a reasonably slow rate. In the previous example, we 
used a geometric ratio of 2; if the ratio was 3, growth would be even faster, rather 
than slower. We must then look for a geometric sequence with a ratio less than 2. 
Number 1 would be a bad choice, because if we multiply each time by 1, we would 
always get the same number (1, 1, 1, 1, ...), and the sequence generated would have 
no use. Therefore, we must choose a ratio that is very close to the number 1. 

While Napier used a ratio that was slightly less than 1 (he took the number 
0.9999999), Biirgi took one slightly larger than 1, and thanks to this, he got a first 
glimpse of the number e. And yet, the number e has sometimes been called the 
Napier number, but never the Biirgi number, which historically would have been more 
appropriate. 

Specifically, Biirgi used number 1.0001 as the ratio for his geometric progres- 
sion, whilst to build his arithmetic progression, he added 0.0001 each time. The first 


terms of these two successions were 


0) 0.0001 0.0002 0.0003 0.0004 
1 10001 1.0001° 10001’ 1.0001’ 


In Burgi’s logarithm table, the geometric sequence had a ratio of 
1.0001 and the arithmetic was obtained by adding 0.0007. 


Strictly speaking, in Biirgi’s original table, all the numbers in the geometric 


progression were multiplied by 


000000001 = —- 
10 


to slow their growth even further. In our explanation, we are omitting this factor, which 
is a decision that facilitates calculations without affecting the most essential idea. 

As stated above, the first table in the chapter was a table of base 2 logarithms. 
With this same idea in mind, how does the Biirgi table correspond? If we refer to 
b as the base, we have, for example that b°°°°' = 1.0001, from which it follows that 
b= 1.0001! that is, 
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1 10,000 
10,000 


which approximately amounts to 2.7181459268. As we shall soon see, inadvertent- 


ly and indirectly, Biirgi had just introduced the first known approximation of the 
number e. 


How exactly is the number e defined? Suppose that we calculate successively 
1 2 3 4 1 5 6 
(1+) (145) ; (1+2] (+=) ; (1+2) [i+ 4) ,-.- In other words in 
1 Ps 3 4 5 6 


the formula +2. *) , we are replacing the letter n by the numbers 1, 2,3, 4,5,6,... 
n 


Let's look at the results obtained in the following table (with a rounding up to ten 


decimal places). 


Sac em Sie 
oe 
es oe 
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As the value of n increases, the results obtained become closer and closer to a 
specific number, a number that starts with 2.71828182845904523536... This 


is precisely the number e, which we can define as the value for the results of 


(+2) 
i+— 

n 
when n represents the values 1, 2, 3, 4,5, 6, ... 


If we think of Biirgi’s table as a table of logarithms, its base is 


a 
1+- 
n 
with n = 10,000, that is, it is a good approximation of the number e. 


Napier, as we have already mentioned, used the number 0.9999999 = wr 
while its arithmetic succession was found by adding a .A similar reasoning to 


that we have used above shows us that the base of Napier’s logarithms was, 
( es): which is 0.367879422971 (rounded to 12 decimal places). As we shall 

10’ 
see, whilst the Biirgi base is a good approximation of e, Napier’s is a good approx- 
imation of the number = . =(Both multiplied their geometric sequence by a factor 
that further reduced their ‘speed’, in this case, that factor, which we have omitted in 
the calculation was i ). 

10’ 

Was Biirgi aware that, implicitly, he was getting a good approximation of one 
of the fundamental constants of nature? Probably not. As with so many discoveries 
(scientific or otherwise), it was other mathematicians who demonstrated the im- 


portance of the finding. 


Area under a hyperbola 


Napier died a few years after making his method of logarithms public, and his 
work on this subject was continued by the English mathematician Henry Briggs 
(1561-1630). 
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LOGARITHM TABLES 


The image shows a fragment of a typical table of decimal logarithms. 


"O21 0253 0294 0334 0374 | 
0607 0645 0682 0719 0755 | 
| 9934 | 0969 1004 1038 1072 1106 | 
39 1173 1206 1239 1271 | 1303 1335 1367 1399 1430 | 
61 1492 1523 1553 1584 | 1614 1644 1673 1703 1732 | 


61 1790 1818 1847 1875 | 1903 1931 1959 1987 2014 | 
M1 2068 2095 2122 2148 | 2175 2201 2227 2253 2279 
Of 2330 2355 2380 2405 | 2430 2455 2480 2504 2529 
53 2577 2625 2648 {| 2672 2695 2718 2742 2765 
768 2810 2833 2856 2878 | 2900 2923 2945 2967 2989 | 


23 26 29 
21 24 27 
20 22 25 
18 21 24 
17 20 22 
16 19 21 
16 18 20 


4 8 
4 8 
a7 
3 6 
3 6 
3 6 
3 § 
23 
25 
2 4 


For example, the leading number is indicating that the decimal logarithm of 11.1 is approx- 
imately 1.0453 (the 1 in front of the decimal point is perfectly understood in the table). The 
last three columns, grouped under the heading ‘Proportional Parties’ help to approximately 


calculate the antilogarithms for numbers that are not in the table. 


Briggs understood that to execute difficult calculations, it was more conven- 
lent to work with base 10 logarithms, because 10 is the base of our numbering 
system. Thus, in 1624, Briggs published what can be considered the first mod- 
ern table of logarithms. It contains neither a geometric or arithmetic sequence, 
but all numbers between 1 and 20,000 and between 90,000 and 100,000 with 
their respective decimal logarithms (that is, what are usually called base 10 
logarithms) calculated to 14 exact decimal places. It must be said that Briggs’ 
idea was completely correct. In fact, base 10 logarithms were used up until the 
1970s to perform complex calculations. Throughout this time, there were no 
tables similar to those published by Briggs in 1624 used by anyone working in 
a technical or scientific discipline. (Even today, scientific calculators often have 
a key that calculates the decimal logarithm of a number x, an operation that is 
usually written log(x), omitting the subscript 10.) 

But let us return to the story of the discovery of e. Before describing the follow- 
ing, we have to discuss the coordinates of a point. 

In a similar way to what we see on a map, the positions of points on a plane 
can be described by two coordinates, the abscissa and the ordinate, which are 
equivalent. In the case of a map, these are longitude and latitude. To establish 


these coordinates, as shown in the figure below, two coordinate axes — one 
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horizontal and one vertical — are plotted. The abscissa of the point is obtained 
by projecting it onto the horizontal axis and its ordinate by projecting onto the 


vertical axis. 


The abscissa of point P is 2 and its ordinate is 3. 


Around 1630, the French mathematician and philosopher René Descartes 
(1596-1650) and the brilliant amateur mathematician and lawyer Pierre de Fermat 
(1601-1665) each conceived the idea that based on a coordinate system it was pos- 
sible to describe a curve (a ‘geometric object’) by a formula (an ‘analytical object’). 
This idea was the beginning of analytical geometry, a fertile branch of mathematics 
that opened new and very deep paths into this science; its importance has not di- 
minished to this day. 

Although Fermat conceived the idea a few years earlier than Descartes, it was 
the latter who published it first and for that reason, many history books attribute 
the creation of analytical geometry to Descartes. Moreover, the coordinate system, 
as shown in the above figure, is usually called a Cartesian system, a word that comes 
from Cartesio — a Latin version of Descarte’s surname. 

The abscissa of a point (the first coordinate) is usually indicated by the letter x, 
whilst the other coordinate is usually indicated by the letter y. By way of example, 
let us take the formula xy = 1. How does this formula describe a curve? The formu- 


la xy = 1 tells us that we are considering that the product of the two coordinates of 


each point is 1. Some of these points are, for example, 


on(da} on) «(F4} 
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as in all cases, the product of the two coordinates is equal to 1 (for simplicity, we 
limit ourselves only to those points that have positive coordinates, a restriction that 
has a historical sense because, in the first half of the 17th century, most mathema- 
ticians rejected the existence of negative numbers). Now, if we could mark in the 
system of Cartesian axes all the infinite points that fulfil the relationship xy = 1, we 


would obtain, as we have said before, a curve — in this case a hyperbola: 


0 1 4 3 


Hyperbola defined by the relationship xy = 1. 


But in 1661, the Dutch mathematician and physicist Christiaan Huygens asked 
himself the question: if we choose a number a > 1 on the horizontal axis, what 
will the area of the region be between the hyperbola and the x axis and between 


numbers 1 and a, that is, the area shown in the following figure? 


Y 


A region between the hyperbola and the x axis, between the numbers 7 and a, 
the area of which Huygens calculated in 1667. 
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CHRISTIAAN HUYGENS (1629-1695), ASTRONOMER AND 


WATCHMAKER 


Christiaan Huygens was born in The 
Hague in The Netherlands on 14 April 


1629. He was a mathematician, astron- 


omer and physicist, and he also studied 


law. In 1654, he created a method for 
producing high-quality lenses, and 
with them, he built the telescope that 
allowed him to detect, in 1655, the 
moon Titan — the first discovered sat- 
ellite of Saturn. The following year, he 
discovered that the form which Galileo 
had noted in the vicinity of Saturn was, 
in fact, a ring circling the planet and 
in 1659, he published a work on the 
movement of the ring. Moreover, as 
his astronomical observations required 
very precise measurements of time, 
Huygens designed and built the first 
precision pendulum clock — a patent 
that allowed him to earn a significant 


amount of money. 


Portrait of Huygens, by the Dutch painter 


Caspar Netscher. 


As a mathematician he was one of the precursors of calculus. In 1657, he wrote the work De 


Ratiociniis in Ludo Aleae (Reasoning in Games of Chance), which was the first work published 


on the theory of probability. 


As a physicist, he studied the movement of the pendulum, the dynamic of uniform circular 


motion and the theory on collision of elastic bodies, in which he highlighted some of René 


Descartes’ errors. However, he is most remembered for, in 1678, his wave theory of light, in 


which he stated that light propagates as a wave. For centuries, Huygens’ theory was in con- 


flict with the corpuscular theory formulated by Newton, which affirmed that light is formed 


of particles. Since the beginning of the 20th century, it has been noted that, in reality, the 


true nature of light implies a combination of both theories. Christiaan Huygens died in The 


Hague on 14 April 1629. 
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The general problem of finding the area of the region between any curve 
and the horizontal axis was solved some 20 years later independently by Isaac 
Newton (1643-1727) and Gottfried Wilhelm Leibniz (1646-1716). The reso- 
lution of this unknown is one of the pillars of calculus — a fundamental branch 
of mathematics created by these two notable scientists. But Huygens, although 
he did not solve the general problem, did find, as we stated above, how to solve 
it in the case of the hyperbola. At that time, the use of decimal logarithms had 
already been generalised to make calculations and Huygens expressed the area 
in terms of these logarithms. Specifically, the Dutch mathematician established 
that the area of the region in the previous figure is equal to the base 10 log- 
arithm of a divided by a certain constant, which Huygens calculated to 17 
exact decimal places, and which begins with 0.434294... It is not clear whether 
Huygens was able to see the relationship between this constant and the number 
e (which was not yet called that, although it had already been identified as the 
base of Biirgi’s logarithms), but the truth is that this relationship already exists 
and that Huygen’s constant is nothing other than the base 10 logarithm of e. 
In fact, today it is known that: 


log(a) 
log(e) 


The area under the hyperbola between 1 and a (with a> 1) = 


It is also currently known that los4) ig exactly the logarithm in base e 
log(e) 
of a, that is, es) = log,(a) which is often called the ‘natural logarithm of a’ (or 
sometimes, as we have already said the ‘Napierian logarithm of a’ and is expressed 
as In(a). The phrase ‘natural logarithm’ to refer to the base e logarithm — although it 
was yet to have that name — was first used in 1668 by the famous cartographer and 
geometrician Nicholas Mercator (1620-1687) in his work Logarithmotechnia. 

And so Huygens once again found, indirectly, the number e. But it was Leib- 
niz, in a letter he wrote to Huygens in 1690, who first attributed a specific letter 
to this number, namely the letter b. It was still to be a few decades before the 
name by which we know it was to be imposed. But what is important is that it 
had already been discovered and defined and although the number e did not arise 
linked to logarithms, it was a consequence of the development of mercantilism. 
Indeed, the first formal definition of e came from a problem related to the calcu- 


lation of bank interest. 
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THE SLIDE RULE 


The slide rule is an instrument designed for complex calculations and was widely used from its 
inception in the mid 16th century up until the development of portable electronic calculators 
towards the end of the 20th century. Its operation was based on logarithms, so it could be 
considered as a ‘mechanical logarithm table’. 

It is not definitively known who its creator was. Whilst some historians attribute its invention 
to the English mathematician Edmund Wingate (1596-1656), others believe it was invented 
by his compatriot the Reverend and mathematician William Oughtred (1574-1660). 


Finally! Jacob Bernoulli discovers the number e 


Let us begin with a hypothetical example. Let’s say that we have 100 pounds and 
that we put them in a bank with compound interest of 100% per annum, and sup- 
pose, in addition, that the capitalisation of the interest is also annual. This means 
that, after a year, our initial capital of 100 pounds will have accrued another 100 of 
interest (100% of 100), and we will then have 200 pounds. 

But what will happen if the capitalisation is semi-annual? In that case, at the 
end of the first half of the year, 50% interest will be added to our initial capital (half 
of the total rate, as only half of the year has passed). That is, at the end of the first 


| semester, we will have 150 pounds and for the second semester, interest will be 
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calculated not on the initial 100, but on the accumulated 150.The interest for the 
second semester will then be 75 pounds (50% of 150), and at the end of the year, 


we will have 225 pounds, as you can see in the table below: 


Tin 
i a 


We note that adding 50% to the accumulated capital is equivalent to multiplying it 


by (1 + 4 ;and as the capitalisation is biannual we will have accrued interest twice 
in the year. Therefore, in the previous example, the capital has been 

iy 1\" 
multiplied by [ re =| . Indeed: 100(1+ 7 = 225. 

ps 


If the capitalisation is quarterly, as there are four quarters in a year, we shall ac- 


cumulate an interest of 25% on four occasions and after one year, the final capital 
1 + 
will be 244.14 = 100{ t< <] . If the capitalisation is of every two months, the in- 
6 
itial capital will be multiplied by C . =| , and if the capitalisation is monthly, the 
6 
12 
final capital will be obtained by multiplying the initial 100 pounds by ( 1+ 5) 
12 


But if the capitalisation is minute by minute, given that there are 525,600 minutes 


525,600 
in a year, the starting capital is multiplied by C + , which is approx- 


525,600 


imately 2.7182792426 (rounded to ten decimal places) and we will thus have 
271.83 pounds. 


In short, if the capitalisation occurs n times per year, then the initial 


capital will be multiplied by i eS +). But what will happen if the capitalisation is 
n 
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JACOB BERNOULLI (1654-1705) AND THE MARVELLOUS SPIRAL 


Jacob Bernoulli was born in Basel, Switzer- 
land on 27 December 1654. His father, a 


magistrate and member of the city’s gov- 


yesenaesenintenaniy 


Se Rafe tet otranrnt re Rp NH Mr + ran ener tain erar ete 
erred ny prleerrerperierementeeen r 


erning council, sent him to the University 
of Basel to study philosophy and theology, 
| and Jacob graduated in these disciplines 
| in 1671 and 1676, respectively. However, 
at the same time, and against the wishes 
of his father, he studied mathematics and 
astronomy — his true vocations. Perhaps 
i without knowing it himself, Jacob set off 
} on a path that would be followed by many 
other members of his family. His brother 
Johann, his three children and two of his 
grandchildren were all mathematicians 
or physicists. Jacob made fundamental Jacob Bernoulli's tombstone inscription. 
contributions to differential calculus and 
mathematical physics. His brother worked in the same fields, which often led to bitter dis- 
cussions about the priority of certain discoveries. 
Among many other contributions, Jacob was the first to study in depth the properties of the 
logarithmic spiral, a frequent curve in nature found in, for example the structure of certain 
snails, some cyclones and also the forms of some galaxies. Jacob called this curve the Spira 
mirabilis, that is, ‘the marvellous spiral’ and was so fascinated with it, he asked for it to be 
engraved on his tombstone along with the following inscription: ‘I shall arise the same, 


although changed’. Jacob Bernoulli died in Basel on 16 August 1705. 


continuous, that is, if the interest is calculated at each instant of time? In this case, the 


initial capital will be multiplied by the number close to the expression 


(i + *) when n becomes immensely large (when ‘n reaches infinity’). According to 
n 


the definition given a few pages back, that value is precisely the number e. We note 


that, as e is worth approximately 2.718281828459..., then because of rounding up, 
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the continuous capitalisation will leave us with a final capital of 271.83 pounds, the 
same as in the ‘minute by minute’ capitalisation. 

The first person to do these calculations was the Swiss mathematician Jacob 
Bernoulli, who, in 1683, identified that the number by which the initial capital was 


1+ - 


to be multiplied to a value around ( “) when n tends to infinity. This was the 
n 


first time in the history of mathematics that a number was defined as the result of 
an infinite process of successive approximations. 

It is true that in order to calculate a value of 1, infinite successive approxi- 
mations are also required, but its definition (the ratio of a circle’s circumference to 
its diameter) makes no reference to an infinite procedure. That is what we meant 
by saying that the definition of e is much more complex than that of x. In short, 
although Jacob Bernoulli did not give it a specific name, nor did he apparently 
perceive the relationship between this number and the logarithms, we can say in all 
fairness that Bernoulli was the true ‘discoverer’ of e. 

However, in all the above calculations, by deducing that if the compounding 
of interest is continuous, then the initial capital is multiplied by e and we have as- 
sumed that the interest rate is 100% per year. What would happen if it were 20% a 
year? In this case, as Bernoulli did, the initial capital must be multiplied by e°. If 
the interest were 35% per annum, the capital would be multiplied by e°** and so 
on. This allows us to define, in a natural way, a function — one of the most important 
concepts in mathematics and an essential tool when describing physical, biological 
or any other phenomena. 

A function is, by definition, any rule that gives each number x, clearly and de- 


fined, another number and, as shown in the following figure: 


rule rule 


eae For example: Pee. 


x y x e 


A function is a rule that each number x 
assign exactly one single number y. 


Here is a function that each number x is assigned by the number e*. That is, 
to the number 0.2 is assigned the number e?, and so on. This function is called 
the exponential function and its fundamental role in mathematics is explained in 


chapters 3 and 5. 
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The first to explicitly analyse the exponential function was Johann Bernoulli 
(1667-1748), Jacob’s brother, who was also a notable mathematician. Johann 
published this study in 1697 in his work Principia calculi exponentialium seu 
percurrentium, where he not only detailed the properties of this function but also, 
and he was probably the first to do so, observed the close relationship between the 
exponential function and natural logarithm, which, as we have already said, is the 
base e logarithm. . 

What is this relationship? Let us recall that the natural logarithm of a number 
A, written In(A), is the exponent to which we must raise the number e to obtain 
A. For example, In(e)=1 because e'=e (for the result to be e, the exponent must 
be 1) and In(1) =0 because e°=1 (for the result to be 1, the exponent must be 0). 

We note that for the number 0, the exponential function assigned to it is 1, 


because e° = 1 and precisely because of this, the natural logarithm of number 1 is 0. 


exponential function natural logarithm 


1 e e 1 


If at number x, the exponential function assigns the number y, 
the natural logarithm does exactly the inverse. 


In summary, if the number x is given by the exponential function y, then the 
natural logarithm — which is also a function — does exactly the opposite. It is thus 
said that the natural logarithm is the inverse function of the exponential (and vice 


versa, that the exponential is the inverse function of the natural logarithm). 


The name of this extraordinary number 


But not even Johann Bernoulli, who so extensively studied the exponential func- 
tion, gave it the name e. The person who attributed it with the letter we know 
today was the great Swiss mathematician Leonhard Euler (1707-1783), who also 
popularised the use of the letter 1 to designate the ratio of a circle’s circumference 
to its diameter. He also introduced many other mathematical notations that are 
still used today. Specifically, Euler first used the letter e to designate the number 
of which we are speaking in a letter sent to the German mathematician Christian 
Goldbach (1690-1764) in 1731. In this letter, written in Latin and dated 5 Novem- 


ber, Euler tells Goldbach how he managed to solve calculation of the area of the 
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region located underneath a certain family of curves. In the midst of this procedure, 
Euler says that a formula that he is working on must be multiplied by e”-", where, 
Euler adds, ‘e denotes the number whose hyperbolic logarithm equals 1’. By saying 
‘hyperbolic logarithm’, Euler is speaking of the natural logarithm. The word ‘hyper- 
bolic’ he uses surely refers to the fact that said logarithm is the one that calculates 


the area under the hyperbola. 


stantis non negligamus. Sumamus casum simplicissimum, quo 


pi et ni, erit dz— Qady = == Multiplicetur 
haec per e’ Hires seu qiod idem est, per e—*”y (e denotat hic. 
numerum, cujus logarithmus hyperbolicus est == 1), prodibit 


e-2¥ yd, — 2e—*"2udv--er?*zdvy—e-*" dy, quae integrata 


Part of the transcript of the letter in which Leonhard Euler designated this name to the number e 
for the first time. 


Euler did not just give the number e the name as we know it today, he also 
studied many of its properties and found several very efficient methods to calculate 
it. That is, methods which allow for quick approximations of e with many decimal 


places, which were especially useful in the time before electronic calculators. 
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Chapter 2 


How and why we calculate 
the number e 


The number e is involved in a large number of formulae related to biology, 
physics, economics and many other branches of science, as well as in many other 
useful applications. The connection between e and compound interest calculation 
was set out in the previous chapter, and we will now see other situations in which 
this constant plays a prominent role. 

As we already said, the decimal writing of the number e begins with 
2.718281828459... But how many decimal places do you need to work with 
on practical applications? In fact, in any reasonable calculation, two or three 
digits are enough, and even in the most extraordinary circumstances, ten digits 
are more than enough. 

Let's say, for example, that we deposited one million pounds in the bank 
for a year at the exorbitant interest rate of 100% per year. We already know 
that, to find out what the accumulated capital will be after one year, that ini- 
tial million pounds must be multiplied by e. If we take the approximate value 
of 2.718281828459, i.e. if we round to eight decimal places, the calculation 
gives us a final accumulated capital of 2,718,281.83 pounds. But if instead 
of eight decimal places, we use nine, ten, 11 or any other large amount, 
the result would be, in practice, exactly the same. And 2,718,281.828459 
pounds, the figure that we obtain working with 12 decimal places, is equal 
to 2,718,281.83 pounds because it does not make sense to consider amounts 
smaller than a penny. 

Nevertheless, in the times before electronic computers, there were people 
who spent years calculating hundreds and hundreds of decimal digits of e. 
Driven by scientific curiosity, they tried to find patterns or regularities in that 
sequence of figures, or perhaps they were simply driven by a ‘sporting’ spirit 
similar to that which can drive someone to try to become the fastest runner in 


the world. 
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Determination of calculators: calculating decimal 
places of e : 


The first person to calculate a significant number of digit of e was Leonhard Euler, 
who, in 1748, published the first 23 decimal places in his book Introductio in Ana- 
lysin infinitorum (Introduction to Infinitesimal Analysis). His marker was the best until 
1853, when the English mathematician William Shanks published the list of the 
first 137 decimal places of e. In 1871, he went on to calculate the decimal places 
to 205, but committed an error. He got number 188 wrong and, unfortunately, all 
the figures following it were incorrect. It was not until two years after his death, in 
1884, when the error was detected by the American engineer John Marcus Boor- 
man (1831-1909). After rectifying it, Boorman calculated and published the first 
346 decimal places of e and established a marker that remained unbroken until the 
middle of the 20th century, when the first electronic computers were finally built. 

This great invention allowed us to tackle very long calculations with a level of 
precision and speed unknown up to this point. Since then, the number of calculated 
decimal places of e has grown enormously. In 1949, the famous mathematician 
John von Neumann (1903-1957) programmed the ENIAC computer, one of 
the earliest electronic computers in history, to calculate the first 2,010 digits of 
e, almost six times the Boorman marker. In 1961, an IBM 7090 computer — oddly 
programmed by a certain Daniel Shanks who, as far as it 1s known, was not related to 
the famous mathematician — managed to process more than 100,000 digits. Thirty- 
one years later, in 1994, there were already ten million decimal places calculated, 
and in 1999, this number reached some 1,250,000,000. The first trillion decimal 
digits of e were established in July 2010, and the first five trillion in August 2016. 

If, two centuries ago, Shanks and Boorman were able to calculate, on average, 
one digit every four or five days, half'a century later ENIAC took 70 hours to cal- 
culate the first 2,010 digits of e, at an average of one digit every two minutes. But 
the computer that calculated the trillion e figures in 2010 needed only 224 hours to 
complete the task. Do you know what this means? An average of 1,240,000 digits 
per second! 

But how are the digits of e calculated? What is the formula used to obtain them? 


We recall that, as we have seen, the number e is defined as the value of [ ++) 
n 


when n is successively replaced by 1, 2, 3, 4, 5, ...The higher the value of n, the 


closer the result will be to the true value of e, although it will never reach it exact- 
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WILLIAM SHANKS, CALCULATING DECIMAL PLACES FOR A HOBBY 
ee ee ee 


William Shanks was born in Corsenside, United King- 
dom, on 25 January 1812. He was a professor of 
mathematics, and spent much of his free time doing 
long and laborious calculations. The most famous of 
these is the computation he made of the first 707 dec- 
imal places of a, work that he completed in 1873 after 
almost 15 years of work. Just as with the calculation 
of the numbers that make up e, with the computation 
of the decimal places of the number x, he too made 
an error. Specifically, the error was at number 528, so 
every subsequent digit proved to be erroneous. But 
Shanks, just as happened with the error at number 
188 of e, died before the error was discovered. The mathematician also published a table 
with all prime numbers between 1 and 60,000 and calculated the logarithms of 2, 3, 5 and 
10 with 137 decimal places. William Shanks died in Houghton-le-Spring, United Kingdom, 
| on 17 June 1882. 


ly. Therefore, to obtain a good approximation of e, that is, to calculate a high num- 


ber of its decimal places, it would suffice to find the result of 


[ rs *| taking a ‘sufficiently large’ value of n. 
n 
Although this is correct, it is very inefficient; to obtain a large number of digits, 


it is necessary to work with huge n values. For example, when n = 1,000,000, we get 


( 1+ 1) = 2.7182804..., that is, only five decimal digits of e (remem- 
1,000,000 
ber that e=2.7182818...), while if n= 1,000,000,000, only three more digits are 
gained. 
So to get good, faster approximations of e it is essential to use alternative formu- 
lae. The first to discover one of these formulae was Leonhard Euler, who published 
it in his Introductio in Analysin infinitorum. A formula whose content is linked to two 


mathematical concepts: factorials and series. 
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What is the factorial function? 


Imagine that three young people named Alejo (A), Bruno (B) and Carolina (C) 
want to take several photographs of themselves in a row, side by side. We want to 
know how many different photographs can be taken, taking into account that we 
are only interested in the order in which the three young people will be located, 
excluding any other circumstance. 

The figure below shows us the answer: the three young people can take six dif- 
ferent photographs. In one of them Alejo is to the left, Bruno is in the centre and 
Carolina to the right. In another Alejo is on the left, Carolina is in the centre and 


Bruno is on the right, and so on, to complete all possibilities. 


ABC ACS CAB 
BAC BCA CBA 


The six different photographs that can feature three people. 
A = Alejo, B = Bruno, C = Carolina 


We now imagine that another young person, say Diana (D), joins the group 
and once again everyone wants to take photographs in a line, how many different 
photographs can they take now? One way to find an answer would be, as we did 
before, count every possible photograph one by one. However, as can be seen below, 


there is a simple way of proceeding, based on the resolution of the previous case. 


Asc ACB EAC. BCA CAB COA 


The different photographs that can feature three people. 
A = Alejo, B = Bruno, C = Carolina. 


ABCD ACBD BACD BCAD CABD CBAD 
ABDC ACDB BADC BCDA CADB CBDA 
ADBC ADCB BDAC BDCA CDAB CDBA 
DABC DACB DBAC DBCA DCAB DCBA 


The 24 different photographs that can feature four people. 
A = Alejo, B = Bruno, C = Carolina, D = Diana. 
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For this, we take each of the six photographs featuring Alejo, Bruno and 
Carolina and we see how many new photographs are generated with the ad- 
dition of Diana. The ABC photography produces four new photographs. In 
one, Diana is added to the right (and so we get ABCD); in another, Diana is 
included between Bruno and Carolina (ABDC); in the next one, she is between 
Alejo and Bruno; and in the last one, she is on the left (ADBC and DABC, 
respectively). This same reasoning tells us that each of the six photographs in 
the previous scenario gives rise to four new ones. Therefore, the total number 
of photographs that can be made featuring Alejo, Bruno, Carolina and Diana 
is 6X4= 24. 

Similarly, if a fifth person was added to the group, say Esteban (E), each of the 
previous 24 photographs would give rise to five new ones. For example, ABCD 
would produce ABCDE, ABCED, ABECD, AEBCD and EABCD. Consequently, 
the total number of photographs for five people is 6 X 4X 5 = 120. 

In fact, we could have applied this technique from the beginning based on the 
immediately previous case. That is, if Alejo is alone, they could only take a single 
photograph; if Bruno is added, he can be placed to the left or right of Alejo, so 
the number of photographs increases to 1 X 2=2. If Carolina now joins them, she 
can be placed to the right, in the centre or to the left of Alejo and Bruno, so each 
photograph generates three new snapshots and the total becomes 1 X2X3=6. So 
for four people, the quantity of photographs is 1X 2x 3X 4=24, and for five it is 
1X2xX3x4x5=120. 

In short, if we have n objects or persons, the number of different ways in which 
they can be placed in line or, as they say, the number of different ways in which they 
can be interchanged, is calculated as 1 X 2 X3...n. That is, the product of all integers 
between 1 and n.To abbreviate, this product is indicated as n! and is called ‘n facto- 


rial’ or the ‘factorial of n’. In this way, for example: 


1!=1 
Zit X22 
3!=1X2x3=6 


41=1xX2x3x4=24 
S!=1xX2x3x4x5=120 
6!=1xX2xX3x4x5x6=720 
7!=1X2X3X4xX5X6X7=5,040 
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So, if we had the numbers 1, 2, 3, 4,5, 6 and we wondered how many numbers 
can be formed using exactly one each time, as in the case of 213465 or 561243, the 
answer is 6! = 720. 

On the other hand, it is useful to define a value for 0! And because == 4, 


> = 3 and == 2, to continue with regularity, it is also appropriate that = =1; 


so for this last equation to be true, 0! must equal 1, which is very convenient 
because there is only one way to take a photograph with 0 people! 

A distinctive characteristic of factorials is that they grow very rapidly. For ex- 
ample, as we have seen, 5! is 120 while 10! is worth 3,628,800; 20! is a 19-digit 


HOW TO CALCULATE THE NUMBER OF 0S AT THE END OF A 
FACTORIAL NUMBER 


There is a very simple way of knowing how many Os are written at the end of n!. In order to 
do so, the integer part of 2 must be calculated (that is, the division is made and the digits 
located after the decimal point are discarded in the result), then the integer part of 2 ; 
ule _... and so on until the entire part is equal to 0. Finally, all numbers obtained are shee 
together. The last result is the number of Os at the end of n!. 


By way of example, let n=35; the integer part of = is 7, the integer part of 5 is 1 and 


thereafter all others give 0. This means that 35! ends with 7+ 1=eight Os. Indeed: 
35! =1,033,314,766,386, 144,929,666,651,337,523,200,000,000. 


number, 30! reaches 33 digits and 70! reaches 101 digits. Imagine how cumbersome 
it must be to calculate factorials before electronic calculators were invented. For- 
tunately, in 1730, the English mathematician James Stirling (1692-1770) published 
the formula that today bears his name and allows to find, in a relatively simple way, 


a very good approximation of n!. Specifically, Stirling’s formula states that: 


n! is approximately /27n (*) 
e 


As can be seen, not only the number e is involved, but also the no less famous 


number zt. So, how good is the approximation given by Stirling’s formula? By way 
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of example, the following table compares, for some small values of n, the exact re- 


sult of n! with the approximate calculations that result from applying the Stirling 


| 


The third column indicated the approximate percentage compared to the exact 


formula (rounded to two decimal places). 


value of n! (for example, 1.92 is 96% of 2).As the table suggests, it can be rigorously 
demonstrated that as n increases, the percentage further approaches 100%, but it 
never actually reaches it. That is, the Stirling formula gives an approximation that, 
in percentage terms, improves with increasing n. 

The Indian mathematician Srinivasa Ramanujan (1887-1920) found a formula 
that provides much more accurate factorial approximations than with Stirling’s 


formula. Ramanujan’s formula states that: 


n\" | 1 
n! is approximately Vn (*) (8n°+4n° +n+ = 
e . 


By way of comparison, with respect to 5! = 120, the Stirling formula gives the 
approximate value of 118.02, and the Ramanujan formula gives 120.000738. 

The invention of electronic computers and the widespread popularity of pocket 
calculators have caused Stirling’s formula to lose some of its Original utility. In fact, 
if we wanted to know the value of 25!, it is much easier to find it directly with a 
calculator than by using the approximate formula. However, Stirling’s formula still 
remains very useful in various branches of mathematics or physics, in which it is 
necessary to work with enormously large factorials. In statistical mechanics, for 


example, equations appear in which sometimes the factorial for the quantity of par- 
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ticles in a physical system appears, an amount that can be of the order of 10” (1 fol- 
lowed by 25 0s). The factorials for this can only be handled by the Stirling formula. 

It is a little paradoxical that this formula uses the number e to calculate an 
approximation of the factorials and that, on the other hand, as we will see later, it 1s 
these same factorials that give us an efficient way of calculating good approximations 
of the number e. That is, the number e ‘calculates’ the factorials, which in turn 


calculate e. 


The series: adding numbers to infinity 


It is not necessary to explain how a finite quantity of numbers is added as in 
the case, for example, for the calculations 2 +4+ — or 76+ —+(—12)+45. 
Solving a sum with several thousand additions is, undoubtedly, very laborious, 
but its difficulty is merely practical and it is free of specific theoretical complex- 
ities. But what would happen if we wanted to calculate the sum of an infinite 
amount of numbers? 

One might think, a priori, that a sum of these characteristics should result in 
an infinite value, but this is not always the case. For example, in the following 
figure, we see a square of side 1 that has been divided infinitely many times. 
First, it has been split in half, then one of those halves has been cut in two and 


so On. 


| 1/64 


Graphical demonstration of 1/2 + 1/4 + 1/8 + 1/16 + 1/32 +... = 1. 
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In the image, within each part, its area has been noted. Given that the total area 

is 1, we therefore have $3 . ii . ae oes 
& 3216. 42 

In mathematical language, the sum of an infinite amount of numbers is called a 
series. When, as in the case of the previous example, the result of the sum is a num- 
ber (i.e. when the result is finite), the series is said to be convergent. But there are 
series that effectively give an infinite result. This happens, for example, with 
1+2+3+4+5+... The infinite result of this sum is written like _ this: 
1+2+3+4+5+...=00, and in this case, it is often said that the series is divergent. 


Now, a series always implies a process of successive approximations. This means 


that when we state that i oe fi ee i +—+—+---=1, we are actually saying that if 
2 


NAG oe 
we calculate successively... 
ee 
ob = = 0.75 
2 4 
Jee es 0875 
2 4 
mie hae a OST 
rae 4 8 16 
riots eee. 
2 a 6 46 32 


...the results obtained will be getting closer and closer to 1. In the same sense, the 
fact that 1+2+3+4+5+...=00 means that if we calculate... 


Se ee 
i+ 24356 
1+2+3+4= 10. 


...the results will become as large as you want them to be. 
These concepts interest us because, in 1748, Euler showed that the number € 
can be calculated as the sum of a series or that, more accurately, e is the sum of the 


inverse of all the factorials, which is expressed in mathematical symbols as: 
ee ae ee 
e=—+—+—+—4+—4—4—4—4—4—4— 4 
OF Ibe 2b ost S16) 7 BE or ae 
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This means that if we successively calculate eA mo. : 1 dock 4 + ,-.-, we will 
oS ee 


; 
: 
4 
| 
i 
' 
| 

| obtain approximations of the number e that will be better each time. In addition, 

with this formula, we will approach the exact value of e much quicker than with 

| the approximations obtained from the expression [ 4 +) _As we have seen, if we 

n 

| use this last expression to obtain five exact decimal digits for e, we must use values 

: of n of the order of one million. Using the series, instead, you - six accurate 

| decimal digits, simply adding up —. In effect, the sum from — to — gives 


| | | | 

| 2.718281525573... (the digits in bold match the exact value of e). : fact, nobles, 
computers allow us to easily verify that adding the first 120 terms of the series gives 
200 exact decimal digits. As an example, the following table shows the result of the 
first partial sums, where the digits that match the exact value of e have been 


highlighted in bold: 


Pa | 


O! 1! 


oe 


jen Be Set SS 
—+-+—+-—4+— 
O! i 2! 3! 4! 


+— aes —— 
i ee ee) 
| ee ae. oe Os See ee 
—+—4+—4+—4+—4+—4+—4+—>— 

tg 2 -4t- 3 Ob 
5 ree: ers Be 


—+—+—+—4+— 
aS! 6 Hi 


2.7182539683 
2.7182787698 


It was this statement that Euler, Shanks and Boorman used to find digits of e. 


~ ~ 
Fs ae 


But this series formula can also extend to the powers of e (which we discussed in 


| the previous chapter, when we set out the example calculating compound interest). 
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Specifically, Euler showed that, where x is any number: 


0 1 a 3 4 3 6 7 


e=-— — —— —— —— —_— _—— —-+--- 


+ 
or a a ae ee eee 


When x=1, we get the same e that we showed before, as 1"=1 whatever the 


exponent. For x =(0).2, the expression that it gives us is: 


pe OEP O20". 0-8" 6 Fae eee 
ee = or + + + + + + + 
oO! 1! 2! 3! 4! 5! 6! 7! 


An interesting example arises when taking x =-1: 


But we notice that if n is even then (—1)"= 1, whereas if n is odd then (—1)"=-1. 
Further it follows that e™' = —.Therefore, the previous series becomes: 
e 
ee aes ee re ee eee ee 
- +—-—+4+—-—4—-— 
ect SS So af: SE er 7 


That leads to the curious equation: 


4 é e e e 4 € e 


+—-——+—-—+4+—-— +. = 1, 
Gc 3t a Ba 7 


Euler’s formula: mathematics is beautiful 


In his book of 1748, in addition to studying the writing of e in series form, Euler 
proposed a formula which, like that of Stirling, simultaneously involved the num- 
bers e and z. Euler’s famous formula has become so well known that, today, it is 
subject to merchandising and can be found emblazoned on shirts, mugs and other 
objects. Did you know that in 1988, the readers of the Mathematical Intelligencer 
journal voted it as the most beautiful mathematical formula in history? 

The formula states that: e” +1 =0. 
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But, what exactly does this equation mean? What does it want to tell us? Let’s 
take this step by step. Firstly, we will address the meaning of the symbol i, which 
appears in this expression. 

The numbers we use regularly — and indeed all the numbers we have so far 
mentioned in this book — are so-called real numbers. These are those numbers 
that include positive and negative integers, such as 0,1, 2,3, 4, —1, 2» —3,... and 


also fractions, such as—,—-—,—, decimal expressions such as 0.25 and 
1 


0.233333...., to numbers such as V2 and 4/7 , as well as those already men- 
tioned — e and 7. 

From ancient times up until the beginning of the 16th century, real numbers 
were sufficient to satisfy all the mathematical needs of humanity, whether prac- 
tical ones, such as those related to trade, the calendar or architecture, or more 
theoretical uses, linked to scientific research. However, around 1530, that situa- 
tion began to change completely. The Italian mathematicians Niccolo Fontana 
(1500-1557) — better known as Tartaglia because of his stammer — and Giro- 
lamo Cardano (1501-1576), discovered the formula for solving so-called cubic 
equations, for example, x*— 2x? + 9x — 18 = 0, where the unknown «x is raised 
to, at most, the third power. Cardano and Tartaglia realised that sometimes the 
process of solving an equation required them to calculate the square root of a 
negative number, as in the case of the square root of —1.The crucial point is 
that this calculation, J-1 , has no solution within real numbers. Indeed, when 
we are looking to find, for example, the square root of 9, we are looking for a 
number (which by convention is positive) that squared is equal to 9, and given 
that 3°=3x3=9, we say that V9 =3. Following the same idea, calculating 
J-1 would consist of finding a number that is squared to be equal to —1, but 
this is impossible because by raising a real squared number, you never get a 
negative result, as 1*=1 and also (—1)*=1. 

For Tartaglia and Cardano, numbers such as J-1 or V—2 were non-existent 
numbers and they called them imaginary numbers as opposed to real numbers. Tart- 
aglia and Cardano considered them mere ‘useful fictions’ that appeared in some 
intermediate steps of their calculations and then they ‘disappear’ in the final result. 
As an example, let’s say that the formula requires us to calculate 3+ J/-2 and 
3—J/-2 and that, then, these two numbers have to be multiplied together. As 
can be seen in the result: (3+ V—2)(3— V—2) = 11, the ‘uncomfortable’ imaginary 
number J—2 no longer appears. | | 
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But over time, mathematicians increasingly appreciated the utility of these 
imaginary numbers. At the end of the 17th century and the beginning of the 18th 
century, Leibniz and the Bernoulli brothers used them in various types of calculations, 
which is why by the middle of the 18th century, it was gradually accepted that these 
numbers ‘really existed’. 

Leonhard Euler was the first mathematician to systematically work with imag- 
inary numbers. He studied their properties, analysed formulae that involved them 


and even introduced the letter i, which appears in the expression e™ +1=0 to refer 


to, as it still does today, the square root of -1. In other words, Euler was the first to 


say that i=~J-1. 


LEONHARD EULER (1707-1783), THE MOST PROLIFIC 
MATHEMATICIAN 


Leonhard Euler was born in Basel, Switzerland, on 
15 April 1707 and died in St Petersburg, Russia, on 


18 September 1783. Considered the most eminent 


mathematician of the 18th century and one of the 


most important of all time, he remains the most 


published author of mathematical studies. Almost 


no one in any area of science has exceeded the 


number of articles produced by Euler, and 50 years 


after his death, many of his posthumous articles 


had been completed. He undoubtedly made fun- 


damental contributions not only to mathematics, 


but also to physics and astronomy. It is difficult to find a subject area in these disciplines in 


which a Euler formula, Euler constant, Euler function or Euler equation does not appear. 


And he had such a capacity for work that even in later life and after he became completely 


blind (as a young man, he lost sight in his right eye due to observing the Sun without pro- 


tection), Euler went ahead with his calculations. Equipped with an extraordinary memory, 


Euler dictated them to an assistant to write them down. He died when, during a break, he 


was playing with his grandchildren. They say his last words were extraordinarily precise: “|’m 


dying”. Two centuries later, referring to his last moments, the Hungarian mathematician 


Paul Erdés (after Euler, the second most prolific mathematician in history) would say with 


admiration: “This was Euler's last conjecture”. No doubt, it was! 
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We note that the equation 1= a implies that, for example, 3+ J—2 can be 
written as: 3+ ,/2(-1) = 34 /2J-1 =3+-J2i. Numbers such as 3+ V2i, 2-i 
or 7+9i, which have a ‘real part’ and an ‘imaginary part’ are now called complex 
numbers, an expression that was first used by Carl Friedrich Gauss (1777-1855) in 
1831.These complex numbers extend to real numbers, that is, they contain the real 
ones as particular cases. Indeed, real numbers prove to be exactly those complex 
numbers, whose imaginary part is 0. In this way, for example, the real number 2 can 
be seen as the complex number 2 + 0. 

Thus, thanks to the development of mathematics, complex numbers evolved 
from an initial stage in which they were considered mere ‘useful fictions’ (doomed 
to ‘disappear’ before the end of the calculation), into a family of numbers contain- 
ing the real ones as particular cases. 

To continue with the explanation of Euler’s formula, it is important to note that 
an essential property of complex numbers is that they can be viewed graphically as 
points on the plane. 

We already know that each point on the plane is described using two coordi- 
nates. The complex number a+ bi can be identified at the coordinate point (a,b). 
The number 5+ 2i, for example, is identified with the coordinate point (5,2). The 
figure below shows the graphical representation of the complex numbers 3—i and 
2+ O01. 


Graphic representation of the complex number 3 —i and 2 + Oi. 


Let us now consider the circumference that is centred on the point (0,0), whose 
radius measures 1, as can be seen in the following figure. This circumference is 
obviously made up of points on the plane, but as we have said that, these points 
represent complex numbers, and it is also correct to imagine that the circumference 
is formed by complex numbers. For example, the complex numbers i=0+ li, 
1=1+0i,-i=0 -—1i and -1=—1+0i form part of it. 
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In the image, we see the circumference centred on the (0,0) and of radius 1; 
also shown are some of the complex numbers that form it. 


Again it was Euler who, in 1748, showed that if tf is a real number feither 1, —3, 
25 ,7, or any other), then e“ is always one of the complex numbers that are part 
or that circumference. Imagine that a point initially located at position (1,0) begins 
to move along the circumference in an anti-clockwise direction. Euler showed that 
when that point has travelled a distance equal to t (we assume that f is positive) then 
it will have arrived exactly at the point corresponding to the complex number ¢ 
(see the circle on the left in the following figure). If t is negative, the conclusion is 


similar, only that the movement is clockwise, as can be seen in the graph: 


In the figure, we see a point that runs over the length of the circumference 
equal to 2, either clockwise or anti-clockwise. 
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EULER’S FORMULA AND SERIES 


The formula e” =-1 can be interpreted working with series. For this we remember that: 


7 
x 


e% = —+—+—4+—4+—F+—4+—4+— 
Of 28 FF FH 


lf in this equation we establish that x = it, we conclude that: 


e™ _ (in) , (iae)’ , (in) (iat) + (ine) (ay? Mim? (in)! 
O! 1! 2! 3! 4| 5! 6! 7) 


On the other hand, from the fact that /?= —1 it can be deduced that /?=-/, /4=1, /°=1, /°= 


—1 and so on. From this it is concluded that: 


+ tee [41] — + 
2! 4! 6! | 3} 3 vy 


2 4 6 nf 5 
Finally, it can be proved that uC & * ,* 2.....-) and that ud A 
Ol 2] aA Ss i gs St A 
from which it follows that e” =-1. 


Consequently, if t=0, the distance travelled by the point is 0, which means that 
the point has not yet left the initial position (1,0), so that e®=1+0i=1, which is 
totally consistent with the fact that e=e?=1 (any number raised to the power 0 
equals 1). 

What happens if the moving point travels the entire circumference? A first 
answer, of course, is that it will return to the initial position (1,0), which corresponds 
to the complex number 1+ 0i=1. 

On the other hand, however, the total length of a circumference of radius ris 
2sr, and given that in this particular case, r= 1, then the total length travelled by the 
moving point will have been t= 27. Consequently, e™ = 1. 

If the point moves always in the anti-clockwise direction and only along the 
fourth part of the circumference, it will travel a distance of * to arrive at point 
(0,1), which corresponds to the complex number i, as eer figure (b) of the 
previous page. That is, ad ej 

Finally, what happens if the point travels half the circumference? In that case, the 
distance travelled will be equal to 1 and the final position, as seen in the following 
figure, will be the point (—1,0), which corresponds to the complex number 
—1+0i=-1. 
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If the point travels a length equal to x, it reaches the position 


(—1,0), which corresponds to the complex number —1. 


A MATHEMATICAL PHOTOGRAPHER 


In 1864 the American mathematician Benjamin 


Peirce (1809-1880) was photographed in-front 


of a blackboard on which he had written the for- 


mula i’ = ~e, which results from the equation 


e? =i shown in the text. Referring to the for- 
mula ij’ =e” Peirce told his students: "This is 


absolutely paradoxical and we do not know what 


it means, but we have tried it and, therefore, we 


know it to be true”. 


To carry out the deduction, we raise to the power 


i both side of the equation and we deduce that 


HR. i 
"| =f. 


Finally, we apply some of the exponentiation properties and the fact that i*=-1, and we 


can conclude that: 


(In the development we have used (a°)* = a’ and that raises the 1 equivalent to using the 
Z 


Square root). 
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Therefore, e” =-1 or, in other words, e" +1=(. In this manner, we have 
demonstrated Euler’s famous formula — ‘the most beautiful in the history of 


mathematics’. 


Bacteria and money: the medium limits growth 


Let us now analyse two other formulae in which the number e is involved. ‘To 
begin with, let us ask ourselves what do bacteria and money have in common? 
The answer is that, if conditions are adequate, both reproduce following the same 
mathematical law. 

To understand this idea, let’s return to the example we studied in the previous 
chapter where a capital C is deposited in the bank at an annual compound interest 
rate of 20% with continuing capitalisation. We know that after one year, the starting 
capital C will have been multiplied by e°?, meaning that the accumulated capital 
after one year will be Ce°”. So what will be the accumulated capital after two years? 
In this case, we would have to multiply again by e°? and consequently the capital 
will be Ce°7e°? = C(e’’)*. Therefore: 


Capital after 1 year= Ce°’. 
Capital after 2 years= Cee? = C(e?”)? = Ce. 
Capital after 3 years = Ce°e"7e°* = C(e*)? = Ce02°3. 


(In the calculations, we have used (a’)‘= a.) 

Generalising these calculations, we conclude that the accumulated capital after 
t years is Ce®”". It is interesting to note that the same formula also applies to non- 
whole years. Thus, for example, the accumulated capital after six months — half a 


year — will be 


Ce 


and the capital accumulated after four months, or a third of a year, will be 
0.24 
te 
According to the terminology introduced in the previous chapter, the 
cumulative capital formula = Ce, defining a function that, as we will recall, 
is any rule that clearly and precisely gives each number another number. In 


this case, the function we are considering assigns tf to each number, which 
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represents the amount (whole or not) of years elapsed — the capital accumulated 
until that moment. When, as in this case the assignment is defined by a Cel 
type formula, where C and k are constants, the function is called exponential. 

In everyday language, the term ‘exponential growth’ is often used to refer to 
extremely rapid growth. Indeed, when k is positive (i.e. when the growth rate 
is positive), the exponential function grows at a high rate. Imagine that the 
starting capital is 10 poundss. The following table shows the values that reach the 


accumulated capital 10e°*' as t grows. 


After 30 years, the initial capital has multiplied by more than 400 times. It is true 
that it may seem a very long time, but let us now compare this growth rate with 
that of bacterial populations. When living space and food are virtually unlimited, 
a bacterial population also grows — like money — at a constant rate. Therefore, as 
we said at the beginning of the section, we could describe the number of bacteria 
by the same function as capital subject to compound interest. This means that if a 
certain population of bacteria increases its number at a constant rate of, say, 20% 
per hour, then after t hours, there will be Ce”! bacteria, where C represents the 
initial amount. 

Thus, if a bacterium has a mass of one billionth of a gram and therefore an initial 
population of a few tens of bacteria have a similar mass, we can deduce that the 
growth rate of the exponential function would follow this sequence. In less than 
20 days, that small initial population of bacteria will exceed the total mass of planet 


Earth and a month later, its mass will be greater than that of the Virgo supercluster, 
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a huge cluster of galaxies that includes, among others, the Local Group — the group 
of galaxies in which our own Milky Way is a member. 

Of course, fortunately for us, neither the living space nor the food source 
available to the bacteria are unlimited, so they never reproduce to the point of 
crushing us with their weight. As the population increases, the food becomes 
scarce — limiting and slowing down the growth of the bacteria numbers. Although 
the population initially increases at a constant rate of 20% per hour, the medium 
in which the bacteria live is capable of sustaining a maximum population of M 
individuals. Under this hypothesis, the formula that calculates the amount of 


bacteria after t hours also involves the number e. Specifically: 


CMe?” 
M+C(e™ —1) 


Quantity of bacteria after t hours = 


So, in this formula, M is the maximum population that the environment can 
maintain and C is the initial amount of bacteria. Known as the logistic function, 
this formula is used in biology to study populations of different species. It can 
be shown mathematically that, as time passes, the bacterial population reaches 
the M value. Life spreads out into all available spaces; if the space is unlimited, 
it grows infinitely, but if the maximum value it can reach is M, then M is its 


population ceiling. 


Carbon-14 and exponential decay 


We have already seen examples of exponentially growing quantities, we will now 
look at an amount that decreases following an exponential function. The example 
we are referring to is that of radioactive decay. Atoms of any radioactive element 
progressively disintegrate into atoms of a different substance, a true transmutation 
like the one sought by Medieval alchemists. For example, radium is transformed 
into radon — a radioactive gas present in the lowest layers of the atmosphere due 
to the decomposition of minute amounts of radium in the soil. In turn, radon is 
transformed into polonium, which is also radioactive, and it becomes lead — a stable 


element that does not disintegrate. 


Radium — Radon — Polonium —Lead 
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Whilst in the money example, this capital increases at a constant rate, the mass of 
any radioactive substance decreases at a constant rate. Consequently, if we have an 
initial amount (C) of a radioactive substance (S), the amount present when a time 
interval t has been passed is calculated by the formula Ce“, where k is a constant 
that depends on each substance. (The minus sign indicates that the rate of change 
is negative, i.e. the quantity decreases rather than increases.) 

A characteristic of radioactive decay, and of any other exponential decay, is that 
the time span that an initial mass C takes to reduce by half is always the same. That 
lapse, which is called the half-life of the element, is independent of the initial mass 
and only depends on the substance in question. In the case of radium, for example, 
its half-life is 1,600 years. This means that any initial amount of radium after 1,600 
years will have been halved, and the rest of the initial radium will have become 
other substances, mostly lead. 

Mathematically, knowing the half-life of a radioactive element allows us to de- 


duce the value of k in the formula indicated above. In fact, let’s suppose that: 
The amount of radium present after t years = Ce™ 


We know that when t = 1,600 the quantity present is = , as the initial mass C 
has been halved. Mathematically, we put it this way: 


Cc Com #1600, 
2 
from which we can deduce that: 
— 
2 


We reiterate that, as we saw in the first chapter, if R, S and T are real numbers 
then R = S* is equivalent to saying that log (R) = T (which reads the ‘logarithm to 


base S of R is equal to T°). In the previous equation, therefore 


1 . 
= S =eand T = —k1,600. 


So we deduce that 


log. (| = kt OC 
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and as the base e logarithm is the so-called ‘natural logarithm’, we write: 


Considering that nl | is approximately —0.6931 (rounding to four decimal plac- 
es) gives: 

—0.6931 = —k1,600, 
from which we conclude that k is approximately 0.0004332. The formula for 


calculating the amount of radium present after ft years is: 


Amount of radium present after t years = Ce?000#9", 


This progressive radioactive decay is very useful in fields such as archaeology or 
palaeontology, specifically through what is known as carbon-14 dating, also called 
C-14 or radiocarbon dating. This technique was developed by the chemist Willard 
Libby in the late 1940s, who in 1960 won the Nobel Prize for chemistry for the 
development of the carbon-14 method for age analysis. 

Carbon-14 is a radioactive variety of carbon that is present in small amounts in 
the atmosphere and is absorbed by living beings through respiration. Therefore, all 
species of plants and animals contain, in their bodies, whilst alive, the same constant 
amount of carbon-14. 

But when breathing stops, the body stops absorbing carbon-14 and, as a con- 
sequence of radioactive decay, the amount of the substance present in the different 
parts of the body begins to decrease. As the half-life of carbon-14 is 5,730 years, 
the amount of this substance present in a fossil allows us to determine with enough 
precision the time elapsed since the death of the organism in question. How is this 


achieved? Let’s start with this formula: 


Quantity of radiocarbon present t years after death = Ce™. 


As the half-life is 5,730 years, a similar reasoning as used for the radium allows 


us to find the value of k, which for carbon-14 is: 


U.6951 = =h5 730 
k = 0.000121. 
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So: 


Amount of radiocarbon present t years after death = Ce°.0!?"", 


Let’s suppose that the fossilised bone of a prehistoric animal has been found and 
that the initial amount of radiocarbon (which, as we said, is the same in all living 


beings) has been reduced to 5%. If t is the time elapsed since the death of the ani- 
mal, up until the present time: 


0.05C = i cae 
0. 05 ee els 


In(0.05) = —0.000121¢ 


And given that In (0.05) is approximately —3, we can conclude that: 


—3 = —0.000121t 
t =_24,793 years. 


Therefore, we will know that the animal died a little less than 25,000 years ago 
In conclusion, in this chapter, we have seen that in addition to its use in the 
calculation of bank interest, the number e also appears in the description of circular 
movement, the calculation of population growth, the mathematical description of 


radioactive decay and the dating of fossils by radiocarbon. Next, we will see how 
the number e also features in the calculation of probabilities. 
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The number e 
and probabilities 


Probability and statistics are two branches of mathematics ubiquitous in everyday 
life. They are very useful to calculate such things as how easy or difficult it is to 
win the lottery or how much the prices of mobile phones have increased by in 
the last year. 

Below, we will see three situations in which probability and statistics are linked 
to the number e. In the first of these situations, there is a very specific problem 
that we will call the ‘hat problem’. The other two, on the other hand, are situa- 
tions with a much more general application, which the quantity of radioactive 
particles emitted during a specific period of time, measurement errors that occur 
in astronomical observations and even the weight of products usually bought in 


a supermarket. 


The hat problem 


Imagine that three men, whose initials are A, B and C, are going to eat at the same 
restaurant and, when they enter, they hang their hats on the same coat rack. Let’s 
also imagine that the three hats are very similar. So much so that when the men 
leave, none of them can clearly pick out their own hat. After much discussion and 
comparison, they decide that each person randomly chooses one of the hats. First 
A chooses, then B and finally C. What is the probability that no one will wear their 
own hat? 

To work this out, we will divide the total number of distributions in which 
nobody takes their hat between the total number of ways in which the three hats 
can be distributed: 

Number of distributions in which nobody 
Probability that no one will _ takes their hat 


wear their own hat Total number of possible distributions 
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For this, we will establish, on the one hand, how many different ways that the 
three hats can be distributed, and on the other hand, we will calculate in how many 
of these distributions nobody receives the hat that belongs to him. 

In how many different ways can the three hats be distributed? One possibility is 
that, due to a lucky chance, each man takes their own. Another scenario would be 
that A wears B’s, B takes A’s and C takes his own. Or that A wears B’s, B wears C’s 
and C wears A’s. And so on. 

To facilitate the calculation, we will number the hats. We will call A’s hat 
1, B’s hat 2 and C’s hat 3. Thus, each distribution of the hats will be indicated by a 
sequence of three numbers. Firstly, we will note the hat that A took, then the hat 
that B took and thirdly the hat that C took. In the event that each one takes his 
own hat, we will indicate 123. If A takes B’s hat, B A’s hat and C their own hat, the 
sequence will be 213 and so on. Consequently, there are so many ways of distribu- 


tion hats as ways of permuting numbers 1, 2 and 3. In other words, there are 3! = 6 
different ways to distribute the three hats. 


In the image, A is wearing hat 3, B hat 1 and C hat 2. We indicate 
this distribution of hats as 312. 


This reasoning, of course, can be extended to any number of people. In this way, 
if there had been four men, there would be 4! = 24 possible distributions of hats. If 
there had been five, there would be 5! = 120 distributions... And so on. 

But now let us return to the case of the three men and look for how many 
distributions there are in which no one takes his own hat. A simple way to find out 
is to check the six permutations of number 1, 2 and 3 one by one and only keep 


those that meet the indicated condition: 
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123 — Discarded because A, B and C take their hat. 
132 — Discarded because A takes his own hat. 
213 — Discarded because C takes his own hat. 


231 — Meets the condition. 
312 — Meets the condition. 
321 — Discarded because B takes his own hat. 


Of the six hat distributions, we are left with those in which 
nobody takes their own hat. 


The conclusion is that there are exactly two distributions in which no one takes 
their own hat: 231 and 312.Therefore, in this case, the probability that no one finds 
their hat is Es = = 0.333... 

x 6 


The n subfactorial 


We know that each distribution of three hats, in which no one takes their own, is 
indicated by a permutation of numbers 1, 2,3 in which none of them occupy their 
original place, for example, 312.The total number of these permutations is indicat- 
ed as !3. Therefore, we have !1=0 (because if there is only one hat, it is impossible 
that its owner does not take it),!2=1 and !3=2. 

The symbol !nis read as the ‘n subfactorial’ and fulfils the following relationship: 


In=(n—1)('(n— 1) +!(n — 2)). 
In this way, for example: 
14=(4—1)(1(4—1) + !(4 -—2)) =3(13 + 12) =3(2 + 1) =9., 
This relationship makes it consistent to define !0 as 1. The deduction is as fol- 
lows: 1=!2=1(!1 + !0) = 1(0 + !0), then 1 = !0. 
Curiously, the factorial also fulfils the same relationship. In fact, it can be verified 
that: 


n! = (n—1)((n—1)! + (n— 2)!). 
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So, what is the probability if there are four people? We said above that the total 
number of distributions is, in this case, 4! = 24. On the other hand, if we inspect the 
permutations one by one and discard those in which someone takes their own hat, 
it turns out that there are only nine distributions in which no one takes it: 2143, 
2341, 2413, 3142, 3412, 3421, 4123, 4312 and 4321 (following the above pattern, 
hats A, B,C and D have been numbered as 1, 2, 3 and 4, respectively). The proba- 


bility for the case of four men 1s, therefore, 2 6 375 
24 


At this point, two questions arise. One is if the only way to calculate the cases in 
which no one is wearing their own hat is to discard, one by one, those not meeting 


that condition. And the other is what does the number e have to do with all this? 


The inclusion—exclusion principle and successive approximations 


Let’s go back to the situation where there are four people, A, B, C and D, and their 
four hats 1, 2, 3 and 4, and give a ‘reasoned’ answer to the question of how many 
hats are distributed (or, in the same way, the permutations of the four numbers) in 
which no one takes their own. The process we are going to apply is known as the 
inclusion—exclusion principle, which was first developed by the French mathematician 
Abraham de Moivre. As we shall see, this method consists of finding the answer by 
successive approximations. To make things easier, we shall divide the reasoning into 


four steps. 


Step 1: Although we know that there are 24 ways to distribute the hats, we 
are only interested in those instances in which no one takes their own. For this, we 
subtract from the total those distributions in which one of the four persons takes 


their own hat. 


Distributions in which nobody takes their own hat = 


Total distributions — Distributions in which someone takes their own hat. 


In how many distributions does A take their own hat? We note that these dis- 
tributions correspond to permutations beginning with 1. Or in other words, with 
the form 1ixyz, where xyz is a permutation of the numbers 2, 3, 4.'To write the 
permutations that begin with 1, we ‘fix’ the 1 to the left and we exchange the other 


three numbers: 
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1234 1243 1324 1342 1423 1432 


So there are 3! = 6 distributions in which A takes their hat. 
To obtain the permutations in which B is the only one who takes their own 
hat, we fix the 2 in the second place (thus indicating that B takes hat 2) and we 


exchange the other three numbers: 
1234 1243 3214 3241 4213 4231 


And we will see that there are six permutations in which B takes their own hat. 
' This is also the case for C and for D: 


1234 1432 2134 2431 4132 4231 
1234 1324 2134 2314 3124 3214 


It should not worry us, for example, in the A and B lists that there are repeated 
permutations (1234 and 1243 appear in both), because the miscalculations that may 
arise from this shall be compensated for in the following steps. As we have said, we 
will proceed by ‘successive approximations’. In this first step, we are only working 
with individual people and we say, simply, that each of them generates six permu- 
tations that must be subtracted or excluded from the total of 24. Therefore, we will 
subtract 4-6=24 permutations and the first approximation to the result will give 
us 24 — 24=0. 


24 permutations — (6 where A takes their hat) — (6 where B takes their hat) — (6 
where C takes their hat) — (6 where D takes their hat) =0 


Step 2: In the first step, we counted the permutations in which one of the peo- 
ple takes their own hat, without taking into account that some of them belong to 
more than one group. For example, permutations 1234 and 1243 are in the group 
in which A takes their hat and also in the group where B chooses theirs. So, these 
permutations have been subtracted twice: once in the group of six that correspond 
to A and the second in the group of six that corresponds to B.The sequences 1234 
and 1243 are the permutations whereby A and B simultaneously take their hats and 
are obtained by fixing the numbers 1 and 2 and permuting the remaining two.This 


idea allows us to deduce what the other repeated permutations are: 
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They take thei 
; 1234 1234 1234 1234 1234 1234 
Permutations: 
1243 1432 1324 4231 3214 2134 


There are 12 permutations that have each been subtracted twice, when they only 


had to be subtracted once. To compensate for this ‘mistake’, we will add (or include) 
those permutations that were previously unnecessarily subtracted. We will, there- 


fore, have a new approach to the answer: 24 — 24 + 12 = 12. 


Step 3: In step 1, we considered the people individually and have subtracted 
the permutations in which each of them have taken their own hat, without the 
repetitions concerning us. In step 2, people have been grouped into pairs and we 
have added the permutations in which the members of each pair simultaneously 
take their hats. 

Following this pattern, in step 3, we shall consider groups of three people and 
subtract those permutations in which all three take their hats simultaneously. But 


now we have four groups of three people: 


They take their hat: A,BandC | A,BandD | A,CandD | B,C andD 


As we see, for each group of three, there is only one permutation in which all 


take their own. Each one is obtained by fixing three figures and although the four 
permutations are actually the same, 1234, the inclusion—exclusion principle does 
not take into account that detail. In conclusion, to the total we now have had to 


this point, we must subtract 4. The new approximation is: 24 — 24 + 12-4 = 8. 


Step 4: Let us summarise. In step 1, individual people were considered and 
we subtracted a certain amount of permutations from the total of 24. In step 2, we 
worked with pairs of people and we added a quantity. In step 3, the groups were of 
three people and an amount was subtracted. Finally, in step 4, we shall take only the 
group of four people that exists (A, B, C and D) and add the only permutation in 
which the four are wearing their own hat (1234, in which they fix the four num- 
bers). Consequently, the final result is 24-24 + 12-4+1=49. 
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If there were to be five people we would proceed in a similar way, albeit in 
five steps. In the first, the total of 5!=120 permutations, would be subtracted 
from those that correspond to individual people (in which a number is fixed). 
In the second step, we would add permutations where two people are fixed, 


and so on. 


Now, if we write the number of distributions in which none of the four people 
take their hat, as indicated by the inclusion—exclusion reasoning: 24 — 24 + 12 — 4 
+ 1,so what then is the probability that no one will take it? The answer is: 


24—244+12-4+1 CES ES i Fee ee eee Sc 
——$§—\ =1-14+---4+— =— -— 4+ — - — + — 
24 2: 6:24 1 a ae 


For the five-person scenario, the probability would be written thus: 


ee te Se ee 


+-—_—-—_— 
oS a ee 


And the same for six, seven or any other larger amount. Even the same expres- 


sion applies in the case of three people, as ti + Lae gives 2 , which is the 


Ui. gS! 
probability we indicated above and also applies in the case for two people and that 
of a single individual (in the case of two people, the probability is —; 
and when there is only one, the probability is 0, as if there is only one person and 
only one hat, it is impossible not to take their own). 


But in the previous chapter, as we have already seen, as addends are added, the 


OFA) 2). aw e 


result of the calculation !_ 1,1 1 + as 4... 18 getting closer and closer to i. 


In other words, as the number of people grows, the likelihood that no one will wear 
their own had will get closer and closer to — = 0.3679 . For example, for five people 


See Se | 
the probability is 2 eo he ee 2 0: 908G, 


Or 2 or a oe 
This conclusion is extremely surprising for two reasons. Firstly, the appearance 
of the number e in which, a priori, there did not seem to be any link. Secondly, the 
manner in which it is derived from the following reflection. If, in the Alpha restau- 


rant, there are 20 men who distribute hats and in the Beta restaurant, which is much 
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larger, there are a million men, in which of the two restaurants is it more likely that 
no one will wear their own hat? 

Intuition tells us that it much more likely, almost certain, that in the Beta 
restaurant, no one will take their own. This is because the probability that, for 
example, the first man chooses their own hat is a millionth, whilst in the Alpha 
restaurant, it is — . Additionally, in the Beta restaurant, the last man will only 
be able to take ae hat if it was not chosen by any of the other 999,999 peo- 
ple. However, contrary to intuition and according to the result set out above, in 
both the Alpha and Beta restaurants, the probability that nobody will wear their 
own hat is practically the same — 36.79%, and it would be the same whether the 
sample is 50 people or a trillion. 

This result can be used to perform a “divination trick’, in which a magician 
picks up a baraja espaiola (Spanish deck of cards) and makes four stacks of 12 cards 
with each of the four suits: oros (golden coins), copas (goblets), bastos (swords) and 
espadas (clubs). They then turn their back and ask a volunteer to shuffle the oros 
and then do the same with those in the other three piles. They then ask the vol- 
unteer to remove the cards from the oros stack one by one as they say the numbers 
1 to 12 aloud, either in ascending order, or in any other order that they wish. 
Before beginning, the magician will have predicted that, at least once, the number 
they say will coincide with that of the letter that the volunteer pulls out. If at the 
end of the oros stack, there is no match, the operation will be repeated with the 
stack of copas and so on and on again with the other two, until the prediction is 
fulfilled. When the match occurs, the magician will receive and acknowledge the 
audiences’ applause. 

Obviously, the trick can ‘fail’. Perhaps the person who selects the cards from 
the four stacks does so without there being any match. But what is the probability 
of that happening? The question is equivalent to considering the probability that 
in four different restaurants, each with 12, no customer takes their own hat. In this 
simile, each ‘restaurant’ is one of the piles of cards, the numbers that the magician 
lists are the customers and the cards are the hats. That the magician says ‘six’ at the 
time that the volunteer precisely selects that card is equivalent to that of the cus- 
tomer, by chance, choosing their own hat. 

If there were only one stack, the probability that the trick would fail would 
be approximately 0.3679, but if there are four stacks, the probability is obtained 
by multiplying that number by itself four times, i.e. (0.3679)* = 0.018 (rounded 
to three decimal places). In other words, there will be a 98.2% chance that the 
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trick works out, or a 98.2% chance that at least one match will occur. There- 
fore, the magician will risk performing the trick because it is more than likely 
to succeed. 

If the magician is daring, they may go a little further and tell the volunteer that 
when there is a match in one pile, abandon it and move on to the next one. In that 
case, the magician can make an even better prediction and ensure that there will 


be a match in at least two of the stacks. The probability of this happening is 85.6%, 


which also means a high expectation of success. 


ABRAHAM DE MOIVRE, A GREAT ADMIRER OF NEWTON 


The French mathematician Abraham de 


Moivre was born in Vitry-le-Francois, on 


26 May 1667, into a protestant family. He 


studied mathematics at the Sedan academy, 


where his notable ability stood out. However, 


in 1685, in following religious wars and the 


expulsion of the Huguenots by King Louis XIV, 


De Moivre was forced into exile in London 


from where he did not return 


In the United Kingdom, De Moivre was able 


to read the first edition of Newton's Principia 


and immediately knew it was a masterpiece. 


For that reason, he wrote some brilliant 


related works, but as he was a foreigner, it 


was difficult to get them published. Finally, 
it was Edmond Halley (1656-1742) who paved the way. As a physicist, mathematician 


and astronomer, Halley had, among other things, financed the publication of Isaac 


Newton's great work. Halley recognised the quality of De Moivre’s work and, thanks to 


his influence, made him a member of the Royal Society in London. 


This eminent scientist published important work on mathematical analysis, analytical ge- 


ometry and probability theory and is remembered for the formula used to easily calculate 


the powers of a complex number: the De Moivre formula. 


Abraham de Moivre died in London on 27 November 1754. 
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The Poisson processes 


Let’s now look at another example related to the calculation of probabilities involv- 
ing the number e. Imagine that on a motorway, there is a toll booth at which, from 
time to time, a car stops, as shown in the figure below. And let’s suppose that for a 
certain period of time, the vehicles arrive at the booth at a relatively constant rate 
of about 12 cars per hour. We refer to an average rate, as the number of cars that will 


actually pass over the course of an hour may be more or less than 12. 


A car approaches a toll booth. 


How could we find the probability that exactly ten cars will pass through the 
booth over the next hour? 

In this scenario, we study the number of times a particular event can occur over 
a specific period of time, assuming that such events — the arrival of a car at the 
booth, a priori random or unpredictable — occurs at a relatively constant rate. When 
these hypotheses are fulfilled, it is said that we are studying a Poisson process, because 
of the French mathematician Siméon Denis Poisson who analysed this type of 
situation and published work on it in 1837. In that study, entitled Recherches sur la 
probabilité des jugements en matiére criminelle et matiére civile (Investigations into Probabili- 
ty in Trials in Criminal and Civil Matters), Poisson considered judicial errors as events 
that occurred at random and calculated the probability of a certain number of such 
errors occurring over a determined period of time. 

But the mathematical nomenclature here commits another historical in- 


justice, similar to that of the Napierian logarithms or natural logarithms, when 
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SIMEON DENIS POISSON, A LOVER OF MATHEMATICS 


Siméon Denis Poisson was born in Pithiviers, 
France on 21 June 1781 and studied math- 
ematics at the Ecole Polytechnique in Paris, 
where he was fortunate enough to have the 
famous mathematicians Pierre-Simon Lap- 
lace (1749-1827) and Joseph-Louis Lagrange 
(1736-1813), who quickly noticed the great 
talent of the pupil. In 1802, after graduating, 
Poisson began to work at that same Parisian 
school. 

Under the supervision of Lagrange and Lap- 
lace, Poisson published his first research 
work in 1808, in which he analysed the per- 
turbations suffered by planetary movements 


due to the gravitational influence of other ae 


bodies in the Solar System. After this, he pro- 
duced a great number of important works 
on physics, differential equations and probabilities. In his later years, he declared that 
the best thing in his life had been ‘to discover mathematics and to teach mathematics’. 


Poisson passed away in Sceaux, near Paris, on 25 April 1840. 


their true discoverer was not John Napier but Jobst Biirgi. Similarly, the so- 
called Poisson processes were first studied by Abraham de Moivre in 1711, 
before Poisson was born, in an article published in Philosophical Transactions 
of the Royal Society. He titled his work De Mensura Sortis seu; de Probabilitate 
Eventuum in Ludis a Casu Fortuito Pendentibus, which can be translated as On 
the Measurement of Chance, or, On the Probability of Events in Games with Chance 
Opportunities. 

In some cases, the Poisson processes have been the basis for original studies, 
such as that by the Russian mathematician Ladislaus Bortkiewicz (1868-1931), 
who in 1898, studied the number of Prussian soldiers killed by being kicked 
by a horse. Usually, this theory is used for more-practical purposes, such as 


establishing the number of calls a telephone exchange receives over a certain 
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period, finding out the number of particles emitted by a radioactive substance 
or how many taxis pass a specific corner over a certain amount of time. For the 
hypothesis, it should be for a period that is not too long, so the arrival rate is 
relatively constant. 

But let’s return to our toll booth where approximately 12 cars pass through 
per hour. We wanted to find out what the probability is for ten cars passing 
through in an hour. But, let’s first ask another question: what is the probability 
that no cars will pass through over the course of an hour? Obviously, if you 
normally have an average of 12 vehicles per hour, it is very unlikely that none 
will pass through, but however unlikely, it could happen. So let’s calculate the 
exact value of this probability. 

First, we divide the one hour period into n parts, each one so small as to make 
it impossible for two or more cars to reach the booth at that time. For example, 
we could take n=3,600. If we divide the hour into 3,600 parts, each of them 
measuring a second, and in a period of one second, there is no time for two or 
more Cars to arrive. 

Once this division is made, we can verify that the probability of exactly one car 
arriving in that period is pe) and therefore the probability that no car arrives 1s 


eS 12. (Likewise, if a coin is balanced, the probability of heads is 0.4, and tails is 1 


nN 


— 0.4 = 0.6.) Finally, to find the probability that a car will not arrive in any of the 


n periods of time, we will multiply 4~— 12 by itself n times, which means that it will 


nN 
be -=) 
n 


Now, to get the answer, we must increasingly reduce these periods to turn them 
into precise moments of time. In other words, n must tend to infinity. It is very 
interesting to note the similarity between the calculation we are making and the 
reasoning set out in chapter 1 related to compound interest. On that occasion, we 
also divided the period of time in question (in this case, a year) into smaller periods: 
semesters, months, or minutes. 

Consequently, the probability that no car will reach the toll booth is the 


value at which the expression 1s approaching( 1-12) when n represents 


nN 


increasing values. And it so happens that, as well as the Renin ++ 
n 
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THE SUM IS EQUAL TO 1 


As discussed above, the probability of a car entering the toll booth within a period of 
one hour is k cars at the toll booth is 12°e"" , provided that 12 is the average number 
of cars per hour. Now if we add up all the brobanilities of all possible quantities of cars, 
the probability of zero cars arriving, plus that of one arriving, plus that of two or three, 
etc. arriving, the result must be equal to 1. If we check it mathematically, we see that, in 


effect, the sum of all probabilities is: 


120" i%e° ife? “(2 12.127 1 
———. +e en — _ 


1 3 3 at 2 ow 


xe « my 


_ 
We recall that € =a aac caer eerie 7 + sf . 
If, in this expression, x = 12, we can deduce that the result of the infinite sum that ap- 


pears in parentheses above is e'*. Therefore, the sum of all probabilities is 


approaching the number e, the expression 1-4) approaches e7'*, which is 


nN 


the same as ah . Therefore, the probability that no car will arrive is as ; 

e e 
approximately six millionths. That is, supposedly, the event may happen about 
six times every million hours. As a million hours is about 114 years, we can 
conclude that if the rate of 12 cars per hour remains constant day and night for 
more than a century, the fact that no cars pass for an hour would happen 
approximately once every 19 years. 

Now, what is the probability that exactly ten cars will pass? The reasoning in this 
case is similar to that shown above: the time is divided into n small parts and the 
probability is calculated that out of every ten one cars, arrives. Once this expression 
is found, the value at which it approaches is found when n tends to infinity. The 


result is as follows: 
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i dad Be 1 Oe 1 
The probability of ten cars arriving in 1 hour = =—— -—. 
10! 10! e 


The probability is approximately 10.48%. Generally, the probability that k cars 


—— 
arrive within one hour is 12°¢- (if k = O is actually obtained, e'*); while for a 
k! 
7fie ae 
k! 


period of t hours, whether f is an integer or not, the probability is 


But there is another calculation, associated with this same situation, in which 
the number e also intervenes. Suppose a car has just left the toll booth and at that 
point, we ask ourselves what is the probability that the next car will take more than 
10 minutes to arrive (a period we express as — of time, as we are measuring the 
time in hours). This is equivalent to wondering what the probability is that no cars 


will pass through over the next 10 minutes, which is — according to the formula we 


1 


have shown above, oe 6 =e * =(0.1353, that is 13.53%. 


But there are other more practical applications. Imagine that we are at a cor- 
ner where we know a taxi passes approximately every five minutes (— taxi per 


minute, on average). We have just missed one and we urgently need to take a 


taxi in the next 60 seconds. What is the probability that we will achieve this? 

The answer, according to what we have already seen, is that this probability is 
e 

1-—e > =0.1813 , that is, 18.13%.There is no doubt that, although we are not always 


aware of it, the number e is key to the most common and everyday situations. 


| Normal distribution and the Bean machine 


There is an ingenuity related to the probabilities in which the number e appears. 
| This is the bean machine, also known as a Galton box, taking its name as it was first 
designed by Francis Galton. 

The machine in question is a device consisting of a vertically placed box, 
through which many small balls are successively dropped (for the described effect 
to be visible, the number of balls must be large — at least 300). As they descend, the 
balls hit a series of obstacles, and each then falls at random to the right or to the 
left. At the end, all the balls accumulate in different receptacles at the bottom of the 


box, as shown in the following figure: 
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Schematic image of a bean machine (Galton box). After hitting obstacles, 
the balls accumulate in different receptacles. 


In a bean machine the number of receptacles is always equal to the number of 
obstacles that each ball encounters in its fall, plus one. In the case where there are 
20 (the more obstacles there are, the more visible the effect that we are going to 
describe), the receptacle at the left-hand end will receive all the balls that, after col- 
liding, always fall to the left. In the next receptacle, will be those that fell 19 times 
to the left and once to the right, and so on until reaching the receptacle at the right 
-hand side, where the balls that always jumped to the right will be stored. How can 
we distribute them all in one receptacle? It is interesting to note that, although 
it is impossible to predict which receptacle each of the balls will fall into, we may 
instead know the overall ‘pattern’ that will form. This situation is completely anal- 
ogous to what happens in polls prior to an election day, because although it is not 
possible to know in advance who a specific citizen will vote for, pollsters can fairly 
accurately predict what percentage of votes each candidate will receive overall, 
whenever surveys are based on a sufficiently representative sample. 

Before we figure out how the balls will be distributed in the receptacles, we 


will pose a simpler question. If we throw a balanced coin four times, what are we 
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FRANCIS GALTON, A MULTIDISCIPLINARY RESEARCHER 


Francis Galton was born in Sparkbrook, near 


Birmingham, UK, on 16 February 1822. He 
read medicine and mathematics at Cambridge 
University, graduating in 1844. Galton, who was 
fundamentally an explorer and anthropologist, 
travelled throughout Africa and pioneered the 
study of human intelligence from an evolutionary 
standpoint. His contributions to science earned 
him great recognition, including the Royal Medal 
awarded by the Royal Society of London. 

Galton died in Surrey, United Kingdom, 17 
January 1911. 


most likely to get? four heads or two heads and two tails? Let’s first observe that 
when throwing a coin four times there are 16 possible results, which we will group 


according to the number of heads (H) and tails (T): 


HHHH (results with four heads) 
HHHT, HHTH, HTHH, THHH (results with three heads) 
HHTT, HTHT, HTTH, THHT,THTH,TTHH (results with two heads) 
TRIB ASHE THT HEFT (results with one head) 
eee yf (results with zero heads) 


Whilst there are six results in which two heads and two tails are achieved, 
there is only one with four heads. This means that it is six times more likely to 
get two heads and two tails than get four heads. Specifically, the probability of 
four heads is = = (0.0625 , whereas for two heads and two tails, the probability 


is —=0 375. The figure shows a graphical representation of the different 


amounts of results in which the central column (representing the number of 
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results where there are two heads and two tails) is six times higher than the 


columns at the ends. 


ith one head 
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Results 


As the height of the columns is proportional to the number of results, 
the central column is six times higher than those at the ends. 


Now let’s go back to Galton’s box and imagine that every time a ball hits an 
obstacle, the ball ‘metaphorically’ throws a random coin that jumps to the left 
if it comes down on a head and the right if it is a tail. This tells us that if there 
were four obstacles, the balls would be distributed in the receptacles in the bot- 
tom of the box as shown in the figure above and most of the balls would remain 
in the central receptacle (because it is more likely to jump twice to the left and 
twice to the right than any other combination of jumps), while the quantities 
would decrease symmetrically towards the ends of the box. The same would 
happen if the number of obstacles were greater: the central receptacle would 
receive a larger amount of balls and the number would decrease symmetrically 
towards the ends. 

In fact, it can be shown that if both the number of balls and the number of 
obstacles is large (the larger the better), the columns of balls at the bottom of the 


box will draw a curve similar to that shown in the first graph below. A curve that, 
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depending on the number of balls and the number of obstacles, can be flattened 


out, as shown in the graph below. 


Figure a) is the ‘mother’ curve; all others are deformations of it. 


Thus, under such conditions, the arrival of the balls into a specific receptacle 
can be treated mathematically as a Poisson process, similar to the case involving 
the cars and the toll booth. And in this situation, the probability that a ball ends 
up in a specific receptacle can be calculated using the formula including the 
number e. It is not surprising, then, that the number e is involved in the mathe- 
matical expression which describes the curves of the previous figure. In fact, the 
‘mother’ curve is mathematically described by the following expression involving 


not just e but also 7: 
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This means that if point P, with coordinates (x, y), is a point on the curve of 
figure a) (see the previous page), then the value of y can be calculated by applying 
x to the above formula. 

In Galton’s box, the number of balls that will eventually be in each receptacle 
is the result of many successive random events, generated by collisions of the 
balls against the obstacles. It follows that certain numbers will arise randomly 
as a result of a series of random events, independent of each other. Under these 
assumptions, it can be shown that these random numbers will be distributed 
following a curve similar to that shown in figure a) on the previous page. 
That is, there will be a ‘central’ quantity whose probability will be greater and, 
as we move away from it, the probabilities will decrease symmetrically. The 
quantities distributed according to this pattern are said to be normally distributed 
or following a normal distribution. 

A very important application of the normal distribution occurs in the the- 
ory of measurement errors. When, for example, astronomers try to establish the 
location of a star, the recorded position is never accurate, as measurement errors 
inevitably arise due to, among other factors, small defects in the telescope’s lens, 
imperfections in the eyes of the observer or light deviations caused by atmos- 
pheric effects. As they are the result of several random events, these errors will 
follow a normal distribution. This, in turn, will allow us to know in an approxi- 
mate way the magnitude of the errors made and to give a good approximation of 
the real position of the star. 

This technique for determining errors in astronomical measurements was used 
with great success by Carl Friedrich Gauss in the early years of the 19th century, 
and that is why any curve like that in the previous figure is often called a Gaussian 
bell curve. Another small historical injustice, as the normal distribution had already 
been discovered long before Gauss’ era. Several mathematicians observed this in 
studies carried out throughout the 18th century, the first of which was work pub- 
lished in 1733 by Abraham de Moivre. 

As we have seen, probabilities and statistics are present in many facets of our 
daily life and the Gaussian bell curve is involved in many of these. These include the 


weights of canned goods bought in a supermarket, which do not always coincide 
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CARL FRIEDRICH GAUSS, ONE OF THE GREATS 


Johann Carl Friedrich Gauss was born in 
the Duchy of Brunswick, currently Ger- 
many, on 30 April 1777. If, in life, he 


was recognised as the best mathemati- 


cian of the era, today, he is considered 


— along with Archimedes (287-212 BC) 

and Newton — one of the three greatest 

mathematicians in history. 

It is impossible to sum up Gauss’ 

enormous work in a few lines, so we can 

only touch upon the precocious talent 

of a young man who, aged 17, solved 

a problem that had been unsolvable for 

centuries. Can regular polygons of 7, 

9, 11, 13, 14, 17, 18 and 19 sides be 

constructed by a ruler and a compass? 

Although in classical antiquity, methods for constructing the regular polygons of 3, 4, 5, 6, 8, 
10, 12, 15, 16 and 20 sides were already known, it was not known whether it was possible 
to do so with the regular polygons mentioned above. 

The unknown was solved by Gauss in 1794 when he demonstrated that, of all these pol- 
ygons, It was only possible to create with a ruler and compass the 17 sides and explained 
how to do it — providing the first new construction of this type known in more than 2,000 
years. Gauss was always so proud of this achievement that, despite the countless triumphs 
he achieved in his life, he asked for a regular 17-sided polygon inscribed on a circumference 
to be engraved on his tombstone. 


Gauss died in Géttingen, Germany, on 25 February 1855. 


with those indicated on the packaging, the useful lifetime of electric lightbulbs or 
the lengths of screws produced by the same machine. 

However, care must be taken to avoid falling into ‘abuse’ of the normal distribu- 
tion. The results of the intelligence tests, for example, are usually indicated according 
to a Gaussian bell curve with the value 100 in the centre. But this refers only to the 


results of the tests and not necessarily to the intelligence itself. There is no reason to 
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suppose that human intelligence is distributed according to a Gaussian bell curve, 
or to believe that the complexity and richness of human intelligence can simply be 


measured by a number. 
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Chapter 4 


An irrational, transcendental 
and perhaps normal number 


We have already learned various facts about the number e and the extraordinary 
ability of mathematics to quantitatively describe natural or economic phenomena. 
In fact, the idea that the Universe is governed by a law that can be expressed math- 
ematically massively precedes the discovery of the number ¢ itself. At least in the 
West, the first to be aware of this were the Pythagoreans — disciples of the philos- 
opher and mathematician Pythagoras of Samos, who lived in the 6th century BC. 

The Pythagoreans attributed so much importance to mathematics that the main 
motto of the school was ‘everything is numerical’, referring exclusively to positive 
integers like 1, 2, 3, 4, 5, etc. And it is through this that Pythagoreans believed that 
all the magnitudes and quantitative relations in the Universe could be described by 
whole numbers. 

They believed that it was always possible to choose a particular unit of measure 
to express the length of any pair of segments, A and B, in the form of integers. For 
example, if A and B measure 2.4 and 3.04 metres, respectively, their measurements 
are obviously not integers. But if we express them in centimetres, the lengths of A 
and B become 240 and 304 centimetres, which are now both whole quantities. The 
Pythagoreans argued that this conversion can be done with any pair of segments. 


But is this really so, or are there segments that contradict this idea? 


The immeasurable, or the impossibility to measure 
two segments at the same time 


Let us imagine two segments, A and B, which the Pythagoreans maintained can be 
expressed on the basis of a unit u. That is, on the basis of a segment fixed as a unit 
of measure so that the length of A can be expressed:as a positive integer n and that 
of B as a positive integer m. Let us imagine, as the following figure shows, that A 


measures 3 u and that B measures 7 u. 
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i length(A) = 3 u 


' length(B) = 7 u 


Segment A measures three times the unit and B, seven times. 


From this figure, it follows that length(A) = = length(B) , where length (A) is the 


length of segment A and length (B) that of B; consequently, cs cl oo 2 
length(B) 7units 7 


Therefore, if A measures n units and B measures m units, we can extrapolate that: 


length(A) mnunits 1 


length(B) munits m 


Which means that for that unit u to exist, the quotient of the lengths of A 
and B must be able to be written as a fraction. In the example shown above, it 


is true that 


length(A) 2.4m _ 15 


length(B) 3.04m 19 


But, to their great surprise, the Pythagoreans discovered that this condition is not 
always fulfilled. They discovered that the quotient of the lengths of certain segments 
cannot be expressed as a fraction, so that for these, there is no unit u that allows us 
to express both lengths as integers. Let’s take an example where B is the side of any 


square and A 1s its diagonal: 


B 


A = hypotenuse B= cathetus 


On the left, a square of side B and diagonal A. 
On the right, a right-angled triangle where A is the hypotenuse and B the leg. 
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In this figure, we have greyed out one of the two halves of the square, which 
forms a right-angled triangle in which the hypotenuse is segment A and each leg 
is equal to B. According to the Pythagorean theorem, in a right-angled triangle, the 
square of the hypotenuse is equal to the sum of the squares of the legs. Therefore, 


the following statement is valid: 


length (A)* = length(B)’ + length (B)’ 


length (A) = Jlength(B)’ + length(B)’ = ./2 length(B)? = J/2 length (B) 


As a consequence: 


length(A) v2. length (B) _ 
length(B) _—_ length(B) 


The question is: can you express V2 as a fraction? As the Pythagoreans have 
shown, the answer is no. 

To prove this, we will use a demonstration technique widely used in mathe- 
matics called demonstration by the absurd or reduction to absurdity. According to this 
method, if we want to prove an affirmation, we proceed to verify that it cannot be 
false, because if it were, it would produce a contradiction. In order to do this, first, 
what is wanted to be denied is denied, and then successive logical consequences 
are obtained from that negation until it reaches a logical impossibility. That is to say, 
a contradiction or absurdity, which proves it is not possible that the statement is false 
and, therefore, it must be true. The following figure shows the general schematic for 


a demonstration by the absurd. 


From that assumption 
a contradiction is 
deduced > 
(an absurdity) 


The P 
statement 
cannot be false 


We assume that 
the P statement * 
is false 


General schematic of a demonstration by the absurd, or reduction to the absurd. 


79 


AN IRRATIONAL, TRANSCENDENTAL AND PERHAPS NORMAL NUMBER 


How does this method apply to the demonstration that J2 cannot be written 
as a fraction? What we want to show is that there are no positive integers n and m 
and that therefore J2 cannot be equal to iS Following this method of demon- 
stration by the absurd, we begin by assuming that there are two integers n and m, 
such that: 


J2 Ss nN 
m 

From this equation, we will try to deduce a contradiction. Let’s begin by ob- 
serving that any fractions can be written with a numerator and a denominator that 

a ee, ©: 

do not have common divisors. For example, the fraction — can also be written as 
: Se = 98 
= , without 3 and 7 having common divisors. So, we can suppose that the numbers 
7 = 
n and m have been chosen precisely because they have no common divisors. 


Then, we square both sides of the equation: 


2 z 
Therefore, 2m° =n". 
As n* is double m?, we deduce that n’ is even and therefore n is also even. For 


example, 36 = 67 is even, as is 6. So, because it is even, n is the double of some in- 
teger k: n= 2k: 
n> = (2k) 
n =4k° . 


And given that 2m? = n’, we can conclude that: 


2m = 4k? 


m? = 2k’ . 


As m? is the double of k* then m? is even, so m also is even. So, we deduce if 
J/2 could be written as a fraction “ , then both numbers must be necessarily 
even and therefore necessarily divisible by 2. But this is impossible because, as 
we said before, any fraction can be written in such a way that the two numbers 
that form it do not have common divisors. Here, we have found a contradiction 
because the fraction = has a property you cannot have: its two components are 


even. As the contradiction arises from supposing that the fraction , exists, 
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we will conclude that, in reality, this fraction cannot exist, as shown in the 


following diagram: 


an ene 
n and m do not have * | n and m are 
divisible by 2 


common divisors 


2 cannot be written as a fraction. 


Schematic of the demonstration of V2. 


Therefore, we have seen that J2 cannot be written as a fraction and therefore 
it is impossible to find a unit u, according to which the lengths of the side and the 
diagonal of the square are expressed in whole numbers. The side and the diagonal 
of the square are, according to the denomination given to them by the same Py- 
thagoreans, incommensurable segments, a term that literally means that ‘they cannot be 
measured at the same time’. 

Legend has it that when the existence of the inconmensurables was proved to refute 
their motto that “all is number’ (whole), this caused such a serious conflict among the 
Pythagoreans that members of the school tried to keep the disturbing discovery secret. 
But it seems that Hippasus of Metapontum, who according to some accounts was the 
discoverer of the immeasurable, opposed this ‘pact of silence’ and made the discovery 
public. Some documents testify that he paid for this ‘betrayal’ with his own life. 

Whether or not Hippasus was responsible, the fact is that the existence of 
inconmensurables did not remain hidden for a long time and, among other conse- 
quences, caused the real numbers to be classified into two groups. On the one hand, 
rational numbers, which are those that can be written as a fraction and on the other 


hand, irrational numbers, those that cannot be expressed in this way. For example, 
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the numbers = - and = are rational; they are also integers, because they can be 
written as a fraction, for example, 7 = “. Likewise, the number 0.25 is rational, 
because (0.25 = ; and so is rational the number 0.15454545454545...., because 
0.1545454545... = = . Instead, we have already seen that J2 cannot be written 


as a fraction, so it is irrational. 

An important feature of rational numbers is that their decimal form has either 
a finite number of digits (as is the case for 0.25), or the expression is infinite, but 
periodic, i.e. there is a block of figures that is repeated again and again. This is the case, 


for example with 01545454545... = oo» which we can write as 0.154, or 


ari 
24,975 


0.12456456456..., which is equivalent to the fraction and which we can 


write as (0.12456 . As can be seen, whether written in a finite or periodic form, the 
decimal form of a rational number can be reduced to a finite number of symbols 
(0.25; 0.12456 ; etc.). However, on the other hand, this is impossible for irrational 
numbers, as its decimal expression is always infinite and not periodic. As a sample, 
we set out below the first 200 decimal places of/2 : 

J/2 = 1.41421356237309504880168872420969807856967 18753769480731766797 
379907324784621070388503875343276415727350138462309 1229702492483605 
5850737212644121497099935831413222665927505592755799950501 152782060 
57147. 

Of course, given that J2 is irrational, there is no periodicity in its decimal 
digits. In fact, no periodicity will ever appear, no matter how many decimal places 
are calculated. 

This ‘intrinsic infinity’ of irrational numbers even led some mathematicians to 
doubt that they were really numbers. For example, in 1544 in his work Arithmetica 


Integra, the German mathematician Michael Stifel (1487-1567) wrote: 


On the other hand, other considerations compel us to deny that irrational num- 
bers are numbers at all, because when we try to subdue them we find that they 
are perpetually evasive, so none of them can be apprehended with precision. 


[...] It cannot be called a true number, because it is of such a nature that it lacks 
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PYTHAGORAS OF SAMOS AND THE PYTHAGOREAN SCHOOL 


There is very little known about the life of Py- 


thagoras, as the few older written references 


preserved are those from the work of the Byz- 


antine mathematician Proclus (411-485), some 


thousand years later. 


It is known that the Greek mathematician was 


born in the second half of the 6th century BC and 


died in the first half of the FirtH century BC. Asa 
young man, he travelled through Egypt, Mesopo- 


tamia and probably also India, and during these 


trips, he studied philosophy and geometry. It was 


also then that he came across the theorem that to- 


day bears his name, but had already been known 


by the Egyptians and Mesopotamians for several 


centuries. It is said that Pythagoras was the first 


to show a demonstration of the famous theorem, but the truth is that there is no record of 


exactly how he did it. 


After his return, Pythagoras founded a school on the Greek island of Samos — a community 


whose members were dedicated to the study of nature, philosophy and mathematics. Pupils 


of the Pythagorean school followed very strict rules of life. Among other things, they were 


vegetarian because they believed that when they died, their soul could be reincarnated in an 


animal. Perhaps its three most important precepts were the following: 


1) At the deepest level, the nature of reality is mathematical. In other words, al/ things 


are numbers. 


2) Philosophy contributes to spiritual purification. 


3) The members of the school owe one another complete loyalty. 


After his death, and mainly due to political reasons, the Pythagorean school, with head- 


quarters in several cities, was bitterly persecuted. But, in spite of this, its influence on Greek 


thought lasted for centuries. 
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A GOLDEN NUMBER HIDDEN IN THE PENTALPHA 


It is not known with certainty what the first pair of immeasurable segments discovered by 
the Pythagoreans were. It is assumed that it was formed by the side and the diagonal of a 
square, which seems reasonable as the incommensurability of those segments is deduced 
from Pythagoras theorem. But there are another pair of segments that also competes for 
that position. 

It is a pair of segments that are precisely hidden in the symbol of the Pythagorean school, 
the pentalpha, a figure like that shown below, which is obtained by tracing all the diagonals 
of a regular pentagon. As its name implies, its five points contain five capital letters, a letter 


that is drawn as a triangle similar to our letter A. 


Presumably, the Pythagoreans studied the geometrical properties of this figure. The fact is 
that both the diagonal and the side of the regular pentagon are also immeasurable, as the 
quotient of their lengths is the irrational number 14.5 , known as the go/den ratio or the 
golden number — a candidate for being the first nanohe! number ever discovered. The go/den 


ratio gives the rectangles a proportionality between their sides from 1 to 14-5 , Which is 


2 
considered aesthetically pleasing. This is why so-called aureus or golden rectangles appear 


in many sculptures, paintings and buildings of all ages. 


precision. [...] Therefore, just as an infinite number is not a number an irrational 


number is not a true number, as it covers an infinite number. 
Finding out whether or not a real number is irrational has always been a matter 


of utmost importance. However, here it is the following question that particularly 


interests us: is the number e irrational? 
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The first person to answer this question was Leonhard Euler in 1748, in his 
Introductio in Analysin infinitorum, the work in which he already showed how the 
number e could be written in series form. But as Euler’s answer is based on the 


writing of e as a continued fraction, we shall discuss this first. 


Continued fractions or how to find out 
if e is irrational 


We will now take, as an example, the number 1+ me , which, like V2 is irrational 
and cannot be written as a fraction. However, we may try to write 1+ /2 through 


a kind of “infinite fraction’, for which we shall begin by noting that: 
(1+V2) =14+2/2+(V2) =2+2V2+1=2(1+V2)+1 


In other words, if we call a =1+J2 , then it follows that: a? = 2a +1. 
And if we divide both parts of this last equation by a, we obtain: 


ee pod 
re) 


Given as Q=2+ > , we can replace the @ on the right-hand side of the equa- 
tion by 2+—: - 
a 


If we continue in the same way to infinity, we will arrive at: 
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ae, tt 8, sie ek 5 Seerearec mega 


The expression appearing on the right-hand side of these two equations is 
called a continued fraction. Generalising this example, it is said that a number B = 0 


is written as a continued fraction when it is expressed in this way: 


In this continued fraction, a, is an integer (having the same sign as f; that is, a, 
is positive if B is, and negative if B is), and a,- a,- a,... are all positive integers. From 


this expression of 1+ J2 asa continued fraction, we can deduce that: 


As a result of it being very uncomfortable to write fractions containing frac- 
tions, which in turn contain fractions, to simplify the expression, it is usually 
written as 1+ J2 = PE2 2283 and J/2 = 252725] : 

Let’s see another example of writing a continued fraction, in this case, that of the 
rational number = . For this, first of all we must remember the validity of the 
following equation: = = y . Thus: 

n 
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fie 4 1 1 1 
aif—e Fos a5 ae ee 
7 7 ify 3 1 1 
+ 1+— ear © 7 
5 73 i= 
3 
Consequently: 
2 
a 
1 
1+ 1 
nw 


25 25 
That is, > = [3;1,1,3]. Let us return briefly to the writing of = 


We have observed that the development as a continued fraction of a real 
number can be finite or infinite. It can ‘stop after a finite number of steps’ or 
‘continue infinitely’. This is an extremely important difference for the subject 
at hand, as it shows us that when the continued fraction is stopped after a finite 
number of steps, it is a rational number, whereas if it is irrational, the writing 
continues infinitely. 

It has been found above that writing a continued fraction of V2 never stops, 
which shows us that V2 is irrational. | 

Therefore, to know whether e is rational or irrational, we will express it as a 
continued fraction. As Euler did in 1748, we will find out whether it is finite or 
infinite. But, how is the expression of any number b calculated in a continued 
fraction? To answer this question, we will call | x | the integer immediately smaller 
than x; for example,| 7.96 |=7 , | v2 | = 1 and | e |= 2;if x is an integer, then | x | 
is defined as the number x itself, for example, SiS. 

According to this notation, we will say that the coefficients of writing, as a 


continued fraction, of a number B #0 are calculated successively in this way: 


ay = LB] 
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DIVIDING BY 0 LEADS NOWHERE 


Let us write coefficients as a continued fraction of 22 using the given formulas: 
7 


a -|2|-[35714..]=3 a= 


-(1333...J-1 a= 


a. 
3 
7 


lf we tried to calculate the coefficient a,, the corresponding formula would force us to solve 
the ‘forbidden’ division : . Therefore, the calculation is stopped in the-coefficient a, and the 
writing of the continued fraction 2° is [3; 1, 1, 3]. 

But what does it mean that the dvbion 2 is ‘forbidden’? It means that there is no coherent 
way of assigning a result. For example, why = 7.5? The answer is ‘because 2:7.5 = 15’. 
if would result in a real number x, that number should be 1 = 0-x; but 0-x is always 0, it 


0 
1 
can never be 1. The equation 1 = 0-x can never be satisfied, as 0 does not represent any 


1 
real number or, in other words, ~ has no possible result. 


1 
d, = 1 
Pos, 
1 
a, = 1 
1 a 
B-a, 
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And so on. If, in the calculation of any of the coefficients a division by 0 occurs 
(an operation that in mathematics is ‘forbidden’) then the calculation stops, and we 
will establish that the number is rational. On the contrary, this does not happen if 
the calculation continues to infinity, which indicates that the number is irrational. 


In the specific case of the number e, the first three coefficients are: 


If we follow in this way, one discovers that the writing of e as a continued frac- 


tion has a very interesting regularity: 
e120: 1,4, 1,48, Oo Ee a, £14, 1,63 


After the initial 2, the sequence ‘1, even number, 1’ always follows, and we see that 
the written sequence continues infinitely and never ‘stops’. This allows us to respond, 
at last, to the question formulated above. Is the number e¢ irrational? Yes, it is. 

The irrationality of e is directly linked to the topic discussed in chapter 2, 
where we discussed the successive calculations of decimal places of e. In fact, if the 
number were rational, these calculations would end sooner or later, either because 
they would end in a last digit or because a block of numbers would be reached 
and repeated again and again. But, as it is irrational, it doesn’t matter if a billion or 
a trillion decimal places have been computed. The calculation of the digits of the 


number e can never be terminated. 


Algebraic and transcendental numbers: the mystery 
of squaring the circle 


We know that real numbers can be divided into rational and irrational numbers, 


but, of course, this is not the only possible classification. Now we will look at an- 


89 


AN IRRATIONAL, TRANSCENDENTAL AND PERHAPS NORMAL NUMBER 


other way of classifying them that dates from the 19th century, but which arose as 
a result of a much older problem raised in Ancient Greece towards the 5th century 
BC. This is the ‘famous’ problem of squaring the circle, which we still allude to when 
we face a problem that seems irresolvable. Because the issue of squaring the circle 
certainly is. 

But let’s try to reach that conclusion by our own means. The problem demands 
constructing a square from any circle, using only a non-graduated ruler and a com- 


pass. Take a square, which, as shown in the following figure, has exactly the same 


area as the given circle: 


Non-graded ruler 


Diagram of the square of a circle. From a given circle, you have to construct a square of the same 
area with a non-graduated ruler and compass. 


Although at first glance it may seem a minor problem, in terms of Greek thought, 
solving the squaring of the circle was a matter of utmost importance. And for the 
Greeks, mathematics was reduced to geometry and, in particular, ‘to construct’ was 
equivalent to ‘to calculate’. In this case, constructing a square with the same area as 
a given circle was equivalent to calculating the area of that circle. 

But none of the Greek mathematicians managed to resolve this. The question of 
squaring the circle remained unanswered for millennia and became one of the most 
famous problems in the history of mathematics. 

Finally, the riddle was solved in two parts. The first part was developed in 1837 
by the French mathematician Pierre Laurent Wantzel (1814-1848), and the number 
mt had a prominent place. For the second part, which the German mathematician 
Carl Louis Ferdinand von Lindemann (1852-1939) concluded in 1882, the funda- 
mental role fell to the number e. Let’s proceed in parts. 

To explain what Wantzel found, imagine that we have a circle of radius R and 


that we want to construct a square of side L that has the same area. We recall that: 
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Area of a circle = 1R?” 


Area of a square = L’ 


As the two areas must be equal, then: L° = ”R*.And consequently: 
L=VxR. 
We can then conclude that the side of the square that we want to build must meas- 
ure VR. In other words, given a segment of length R, the problem of squaring 
the circle requires a segment of length VaR, using a non-graduated ruler and a 
compass. 

We already know that from two segments of lengths A and B, it is always 
possible to construct with these instruments, a third segment whose length is 
the product of the other two, i.e. a segment of length A-B, as shown in the 


following figure: 


a) b) 


A | en 


x 


In part a), on two perpendicular lines we draw segments of lengths A, B and 1. 
In part b), the two lines are parallel to each other. 


It can be shown that segment x measures A-B. 


Therefore, the problem is reduced to whether it is possible to construct a seg- 
ment with length Jz (in this case, the figure tells us how to construct another 
length /zR). But is it possible to construct with a ruler and compass a segment 
of length Vm? Or, posing the question in more general terms, what exactly are 
the segments that can be constructed with these two instruments? Pierre Laurent 
Wantzel found the answer and discovered that it was related to the classification of 


real numbers into algebraic and transcendental numbers. 
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A real number is algebraic when it is the solution of an equation of type: 
n 2 
Geo + ate ke + gx+4, 59, 


where q,, 4,5 7,»-.- are all integers. For example, V2 is algebraic because it is the 
solution of equation x*—2 = 0; 3/7 is also algebraic because it is the solution to the 


equation x°—7 =0; and so is the number 2 algebraic, because it is the result of the 


equation 3x—5=0. 3 

The definition can be extended to complex numbers, so, for example, the num- 
ber i is algebraic because it is the solution of the equation x* + 1=0. On the other 
hand, a real (or complex) number is simply transcendental when it is not algebraic. 
That is, if it is not a solution of any equation of the form shown above (we will soon 


show examples of transcendental numbers). 


The highest It is not possible 
power of x is 3, to construct it 
V7 is the solution of B® x»°-—7=0 & | which 4 not Dak ele and 
= ee =e | power of 2. compass. 
The highest It is possible to 
V2 is the solution of @ a 2 = 0 S power of x * construct it with a 
ee . — ss is2= 2. ruler and compass. 


For example... ah 


A segment of length 2 can be constructed with a ruler and compass, for example, by plotting 
the diagonal of a square of side 1. 


Thus, in 1837, Pierre Wantzel demonstrated that a segment can be constructed 
with a non-graduated rule and a compass only when its length is a given algebraic 
number that is the specific solution of an equation in which the largest exponent 


of x is a power of 2 (e.g. 1=2°, or 2=2!', or 4= 2’, etc.). 
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It is possible, therefore, to construct with a ruler and compass a segment of 
length J2 because that number is the solution to the equation x*—2=0, where 
the highest power of x that appears is 2. But it is impossible to do so with a segment 
of length ¥/7 because, although it is an algebraic number, it is the solution of the 
equation x°—7 =0, where the greatest power of x is 3. 

The question, then, is to find out if Vx is algebraic or transcendental, and if it 
is algebraic establish which equation is the solution. This question was answered in 


1882 by Carl Louis Ferdinand von Lindemann, who showed that: 
If a #0 is an algebraic number, then e* is a transcendental number. 


It follows that x is transcendental, which we will see once again, by the reduc- 
tion to the absurd method. Let’s suppose that mt is algebraic. As there is a property 
that says that the product of two algebraic numbers is another algebraic number we 


will deduce that zt also is. 


TWO OTHER CLASSIC AND IRRESOLVABLE PROBLEMS 


In addition to the problem of squaring the 
circle, there are two other famous classic 
problems of construction involving a ruler and 
compass that were also raised towards the 5th 
century BC and resolved in the 19th century. 
One of them is the problem of duplicating 
the cube, which involves constructing from 
the edge of any cube, the side of a cube that 
has twice the volume. The other is the trisec- 
tion of an angle, which consists of finding a 
method that allows for dividing any angle into 
three equal parts. 


The aforementioned Wantzel theorem shows 


that these two problems are also unsolvable. 


Pierre Laurent Wantzel (1814-1848). 


To decipher both questions requires construct- 
ing two segments whose length is an algebraic number that is the solution of an equation 


in which the highest power of x that appears is 3. 


93 


scidieseadd seememnnmndan Rinaeininatainemant aeadiunonade anerencee naar aN 


AN IRRATIONAL, TRANSCENDENTAL AND PERHAPS NORMAL NUMBER 


But then, according to Lindemann’s theorem, e™ would be transcendental. 
However, according to Euler’s formula (see chapter 2), e“ =—1, so —1 would be a 
transcendental number. But this is false: —1 is algebraic because it is the solution of 
the equation x + 1 =0. In short, if 1 were algebraic then —1 would be transcenden- 
tal. But as it is not, we deduce that zt is not algebraic and therefore is transcendental. 


The following figure summarises this reasoning. 


We assume that zt is algebraic. > Then 2.1 is algebraic. 


The e™ 


: 
wi. 4 | ; : 
e 1 is algebraic. 2 | : is transcendent. 


JU is transcendent. 


This chart shows x is transcendental. 


What happens withVx? Another demonstration by the absurd shows us 
that it is also transcendental. If it were algebraic, VnJn =. But we know 
that st is transcendental and that therefore Vn is, SO it is a condition that makes 
it impossible to construct with a ruler and compass a segment of length V0. 
Consequently, given any circle, it is not feasible to construct a square with the 
same area as the circle using these instruments. So, indeed, squaring the circle 
is an impossible mission. 

Lindemann’s theorem allows us to prove that the number e¢ is also transcendental 


in the following way: 


As the number 1 is algebraic (because it is a solution of the equation x—1=0) 


then e'! =e is transcendental. 


However, the transcendence of e had already been demonstrated in 1873, nine 


years before the demonstration of Lindemann’s theory, by the Frenchman Charles 


94 


AN IRRATIONAL, TRANSCENDENTAL AND PERHAPS NORMAL NUMBER 


FINITE AND INFINITE CONTINUOUS FRACTIONS 


There is yet another way to show that e is a transcendental number, based on the writing of 
real numbers as a continuous fraction. 

We know that continuous fractions rational numbers are finite and those involving irrational 
numbers is infinite. But in the case of the latter, it can also be shown that the writing of 
numbers that are not only irrational but also algebraic is always periodic, whereas those that 
are irrational and transcendental are infinite and not periodic. 


For example, in the case of V2 , which is algebraic, writing is indeed periodic, since 


V2 =[1;2,2,2,...] and, as can be seen, there is a block of coefficients (in this case only formed 


by the number 2), which is repeated again and again. But the writing of e as a continuous 


fraction is not only infinite: 
e=2121141,1611,81116111211 1417 


However, it is also non-periodic. There cannot be a block of numbers that repeats over and 
over again, because the even number between the two other factors is constantly increasing, 


it never repeats itself. Which proves that e is transcendental. 


Hermite (1822-1901) using mathematical analysis techniques beyond the scope 
of this book. 

After verifying that e is irrational and transcendental, we will study another 
characteristic that — according to conjecture — also possesses number e — a feature 
that is related to the previous chapter: chance. These numbers are called normal, but 


in reality, they are not so normal. 


Normal numbers: when being normal is a bit weird 


The concept of a normal number was conceived at the beginning of the 20th cen- 
tury by the French mathematician Emile Borel (1871-1956), and describes a real 
number whose digits follow a uniform distribution in which all digits are equally 
likely. To explain this, let’s start by taking the example of a normal number in base 
10. Let’s imagine that we have a ten-sided die (emulating the decimal system) 
numbered from 0 to 9 and that, after throwing it at random, an infinite number of 
times, we write down the numbers obtained one by one. Let’s say that this long line 
starts with this list of numbers 98457701... 
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A ten-sided die. 


And, in front of it, we place a whole number followed by a decimal point, for 
example 11.98457701..., thus obtaining a real number. But what characteristics 
does it have? 7 

We have already said that as the digits have been generated randomly, there is no 
way to predict which number will occupy each place before throwing the die. 
However, it is possible to predict how the digits will behave overall. For example, if 
we use a million, billion or trillion consecutive digits (the greater the quantity, the 
more accurate the prediction), we can predict that about 10% of these digits will be 
0, 10% will be 1, 10% will be 2, etc. And if, in that million or trillion digits, we focus 
our attention on pairs of consecutive numbers, 1% of them will be the 00 pair, 1% 
will be pair 01, 1% 02 and so on. This is because, when a ten-sided die is thrown 

4 1 


twice, the probability that the 00 (or any other) pair comes out is og ee 
1010 100 


which is equivalent to 1%. 

Similarly, if in a million or trillion digits, we look at numbers of numbers 
(three-digit linked sets) then roughly 0.1% of them will be the third 000, 0.1% will 
be the third 001 and so on. 

A surprising consequence is that any finite sequence of numbers, no matter how 
long, appears in the written form of our random number an infinite number of times. 
If, for example, we take the numbers 123456789101112131415 (with 21 digits), then 
that sequence appears an infinite amount of times in writing the number. 

According to Borel’s definition, a number is normal in base 10. In the given 
example, we have represented that base 10 — if written in decimal form, the digits 
after the decimal point behave randomly. That is, when they form a finite sequence 
of digits in writing the number infinitely many times, like that of the sequence of 


numbers generated by the rolls of a die. 
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But the definition of a normal number can be extended to any base. A number 
will be normal in base 2, for example, if when written in binary (only using the digits 
0 and 1), the digits beyond the decimal point behave like those of normal numbers 
of any other base, as if they had been obtained randomly. In this case, as there are 
only two possible digits, the random generator may simply be a coin in which 
heads = 0 and tails = 1, or vice versa. 

Borel defined normal numbers in 1909 and also showed that there are infinite 
normal numbers. However, he could not establish a single example of a specific 
normal number, so he did so indirectly based on the reduction to absurd method. 
By his reasoning, Borel began by supposing that they did not exist and, after proving 
that this led to contradiction, deduced that, an infinite number of these so-called 
normal numbers necessarily existed. Although in reality, as we have seen, they are 
quite special. 

Even today, more than a century after Borel’s demonstration, there are very 
few numbers known to be definitely normal. Furthermore, almost all of them 
have rather ‘artificial’ definitions, thought almost explicitly so that the number in 
question can be defined as normal. Among them, in 1945, the North American 
mathematician Arthur Herbert Copeland (1898-1970) and the Hungarian math- 
ematician Paul Erd6és (1913-1996) worked together when they demonstrated that 
0.2357111317192329... (a figure obtained by juxtaposing all the prime numbers, 
2, 3,5, 7,11, ...) is a normal number. 

- Regarding the number e, for the moment, it has been possible to discern wheth- 
er or not it is a normal number. Although it is believed that e, x V2 and other 
known irrational numbers are normal numbers. At the time of writing, these lines 
of conjecture have not been demonstrated or refuted. The first trillion digits of e 
that have been calculated (see chapter 2) behave as a normal number (about 10% 
of them are Os, 10% are 1s, etc.), but in reality, it does not prove anything because 
behind that billion, there are still infinite numbers of e that can never be fully cal- 
culated. One can only prove that e is normal by general reasoning, it can never be 
achieved by calculating the digits one by one. 

But...what if it were? No doubt we would be faced with a paradoxical cir- 
cumstance: suppose we encode all written symbols using pairs of numbers. We 
say, for example, that 00 is the number 0, 01 is the number 1 and so on to 09. 
We will then follow with the pair 10, which can code the blank space. 11 can be 
the lower-case letter ‘a’, 12 the ‘b’, 13 the ‘c’, etc. Then, we will continue with 


capital letters, punctuation marks and even the mathematical symbols frequently 
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FELIX EDOUARD JUSTIN EMILE BOREL, A PRECOCIOUS 
MATHEMATICIAN 


Emile Borel was born in Saint Affrique, in the Cen- 
tral Pyrenees region of France on 7 January 1871. 
His whole family was linked to commerce and 
industry, and so he was advised to pursue high- 
er education at the Ecole Polytechnique, where 
he would receive training geared towards such 
activities. However, Borel sought a vocation in 
mathematics, which was galvanised by one of his 
professors at the faculty — the famous French math- 
ematician Gaston Darboux (1842-1917). In 1889, 
Borel finally chose to study at the Ecole Normale 
(teacher training college), where he received solid 
training in sciences. 

Borel published his first two research papers in 


1890 when only 19 years of age. Although relatively minor works, they were remarkable 


given his young age. He published his first mathematical article of note in 1892, and the 


following year was awarded a doctorate in mathematics. 

Throughout his career, Borel published essential works on subjects related to real variable 
functions, measurement theory and the calculation of probabilities, and he made first-rate 
contributions to studying the consequences of Einstein's theory of relativity. 


Emile Borel passed away in Paris on 3 February 1956. 


used in elementary operations. In the end, we would obtain a codification that 
would allow us to translate any conceivable text into figures. For example, with 
the sequence of figures 13111411100200 (i.e. 13 11 14 11 10 02 00), we would 
code the expression ‘every 20’. 

It is clear that, using this coding, any conceivable text can be transformed into 
a finite sequence of numbers and if e is normal, that sequence of digits will nec- 
essarily appear an infinite number of times. This means that, if it turns out to be 
normal, any text that has been written or may be written in the future could 
appear encoded in its decimal expression, whether the complete works of Gabriel 


Garcia Marquez, the lost works of Aeschylus or all the novels that will be written 
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throughout the 21st century... Can you imagine the Borges library fully encoded 
in a simple number? 

Under this assumption, all texts may be contained in the writing of e — includ- 
ing the full text of this book. Following the absurd, it would contain the text of its 


own definition and also the explanation of why it contains it. 
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Chapter 5 


The number e and calculus 


Mathematical analysis, also called differential calculus or simply calculus, was developed 
at the end of the 17th century simultaneously and independently by Isaac Newton 
and Gottfried Wilhelm Leibniz. However, their work would not have been possible 
without the contributions of a long list of precursors that goes back to Archimedes, 
who lived in the 3rd century BC. 

Although in the time of Newton and Leibniz, the applications of calculus were 
very restricted to the study of physical and astronomical phenomena, with time, 
their uses diversified in such a way that, today, it is perhaps the most used branch of 
mathematics across engineering and almost all sciences. 

In this chapter, we shall discuss some of the fundamental concepts of calculus 
and the prominent role played, once again, by the number e in this field. And we 
will begin by reviewing new aspects of its functions — the purpose behind studying 


this mathematical procedure. 


Functions in more depth 


A function is a rule that each number f¢ assigns to another number, which we 
can call y. There are many ways to establish this link. For example, we could 
simply say that the function assigns to each rational number a 1 and to each 
irrational number a 0. However, in most calculus-related cases, functions are 
usually expressed by formulae. In the case of the formula y = 20t — 5, the 
function assigns to each number ft a number y that is obtained by replacing t¢ in 
that expression. For example, for a t = 1, the function assigns the number y = 
20-1 —5-1? = 15. 

But what is a ‘formula’ like the one just described? We have already mentioned 
the topic when we discussed the growth of bacterial populations or the decay of 
radioactive substances in chapter 2. A function serves to quantitatively describe a 


great range of phenomena and, above all, predict their future evolution. 
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GOTTFRIED WILHELM LEIBNIZ, THE AMBITION OF GLOBAL 
KNOWLEDGE 


Gottfried Wilhelm Leibniz was born in Leip- 


zig, Saxony, in present-day Germany, on 1 July 

1646. In 1661, at the age of 14, he entered 

the University of Leipzig where he studied phi- 

losophy and mathematics. Throughout his life, 

Leibniz pursued the difficult purpose of collect- 

ing all human knowledge and for that reason, 

he also studied law (gaining a doctorate in this 

field in 1667), as well as physics, Latin, Greek 

and Hebrew. 

Leibniz published his first work on differential 

calculus in 1684 under the title, New Method 

for Highs and Lows, a text containing all the 

essential tools for calculation but which did 

not justify the validity of those rules. For this reason, Jacob Bernoulli commented that 
Leibniz's work was “more of an enigma than an explanation”. 

Newton, who had developed the calculus process in 1671 but did not publish it until 
1687, accused Leibniz of having stolen his ideas. This sparked a long dispute between 
them, in which many mathematicians got involved. For Leibniz, this struggle demanded 
more and more of his time, and he spent the final years of his life embittered. Today, 
however, it is unreservedly accepted that the two men developed calculus contempora- 
neously and independently. 


Leibniz died in Hanover, now in Germany, on 14 November 1716. 


Let’s check this idea with the formula y= 20t — 5t?, where we assume that ¢t can 
only take values between 0 and 4. What physical phenomenon can be described 
by this function? One possible answer is: imagine that, using a mechanism, similar 
to a compressed air cannon, we shoot a ball straight upwards. The ball, of course, 
will reach a certain maximum height and then fall back again. If the initial shot has 
the appropriate force, then the function y=20t—5f? may describe the height that 
the ball reaches through its movement. In other words, at each instant (ft) of time 


(in this case, it is measured in seconds), the formula assigns the height at which 
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the ball is at that moment, which we express in metres. For example, at t=0, the 
formula assigns the value y= 20-0 — 5-0*=0, which means that at the initial time, 
the ball is at ground level. On the other hand, at t= 1, the formula assigns the value 
y=20-1—5-1*=15, that is, exactly one second after being thrown, the ball is 15 
metres high. 

What else does this formula tell us about the movement of the ball? To answer 
this question, we will recall that each function has a curve associated with it — 
which is called its graph. This curve is formed by all points in the plane, in which 
the second coordinate results from applying the function to the first coordinate. In 
the case of y=20t — 5t?, as the function assigns at t=1 the value y=15, then the 
coordinate point (1,15) belongs to the graph. Shown below is the full graph of the 
function. The horizontal axis shows the possible values of t (we have established 
that they are between 0 and 4) and on the vertical axis, the possible values of y are 


represented. 


Graph of the function y = 20t-—5t2; on it, we have marked the coordinate point (1,15). 


The graph helps us analyse what the function ‘tells us’ about the movement 
of the ball. We see that, as t increases between 0 and 2, the curve goes ‘up’, which 
means that after being thrown in the time interval between 0 and 2 seconds, the 
ball rises to its maximum height. This happens at t = 2, i.e. two seconds after firing. 


As t = 2 corresponds to y = 20, that maximum height is 20 metres. 
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20 | 2,20) Highest point of the curve 
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Pt 


For t between 0 and 2 the curve ‘goes up’, for t between 2 and 4 the curve ‘goes down’. 
At t = 2, the curve reaches its maximum height. 


For t between 2 and 4 the curve ‘goes down’. After reaching its maximum 
height, the ball, logically, begins to descend. And given that t=4 corresponds to 
the value 20:4—5-4?=80-—80=0, then 4 seconds after it has been thrown, the 
ball returns to the ground. Therefore, we know that t takes values between 0 and 
4, because beyond the four second period, it no longer makes sense to continue 
describing the movement. 

The following table summarises the description of the function y= 20t—5t? for 


the movement of the ball: 


Poe Sy, os caweenteg 


Let’s now suppose that the function y=30+ t—t? describes the ambient tem- 


perature of a city X. It is measured in degrees centigrade in hours ranging from 


noon to 3 pm on a particular day last summer. Time ft indicates the number of 
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hours that run from noon, so ¢ varies between 0 and 3. Thus, for example, for 
t=0 the function gives the ambient temperature at 12 pm, which is 30°C because 
y=30+0—0°= 30; if t=2 we get the temperature at 2 pm, which is now 28°C 
because y=30+2-—27=28. 

Based on the analysis of the previous example we can ask in what time interval 
between 12 pm and 3 pm has the temperature increased or decreased, or what the 
maximum and minimum temperatures were. To know this, it would be very use- 
ful to draw a graph of the function, but how is it drawn? In reality, nowadays, the 
graphical representation of any formula is produced in fractions of a second thanks 
to computer programs that can be downloaded legally and for free. However, these 
resources did not exist until the second half of the 20th century. Before this time, 
they were produced ‘by hand with pencil and paper’. But how did they manage to 
analyse the growth or decrease of a function before the computer age, or to calcu- 
late, for example, the points at which a function reaches its maximum or minimum? 
This was done thanks to one of the fundamental concepts of calculus, developed by 
Isaac Newton and Gottfried Leibniz. We refer to the derivative, which, in a function, 
measures the rate of change, that is, the amount by which the function changes at 


one given point. 


The derivative: a concept designed to find the limits 


The derivative of a function y is another function, often called y', which calcu- 
lates the rate of growth or decrease of y. For example, if, as we have discussed, 
y represents the height of a ball being thrown upwards, then y' indicates the 
velocity at which that ball is moving. In the second example, where y is the 
ambient temperature, y' calculates the rate of change of temperature. If the rate 
is high, the temperature changes very quickly, if the rate is low, it does so very 
slowly. 

If y has been given by a formula, then so will y' have a formula. But how do you 
get the derivative of y? What are the rules for calculating it? Newton and Leibniz 
were the first to know this. Below we set out some of these rules, let us begin with 
constant functions. 

Let’s suppose that y is a constant function, which always assigns the same number 
to all values of t. Consequently, a function may be useful, for example, in measuring 
the temperature of boiling water. While the water boils, its temperature remains at 


100°C steadily (we assume that it is distilled water under atmospheric pressure). 
y p p 
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Therefore, if we define y as the function that at every instant ¢ is assigned to the 
temperature that the boiling water has at that moment, then the function 1s defined 
as: y= 100. In other words, at any value of t, the function assigns the value 100 to 


it. The following figure shows its graph, assuming that ¢ varies between 0 and 10, 


which implies that the water boils for 10 minutes). 


A constant function that assigns each value of t, between 0 and 10, the value 100. 
The graph is a horizontal line. 


The main rules for finding derivatives 


A constant function does not grow or decrease per se. Its rate of growth or decrease 


is 0, which, in terms of the derivative, translates to the following rule: 
Rule 1: If y is constant, then y'=0. 
Let us now take the function y= 2t, which assigns each number its double. 


What is its rate of growth? To find out, we set out the following table, indicating 


different values of t together with the numbers assigned by that function: 
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ISAAC NEWTON, ONE OF THE GREATEST 


Isaac Newton was born in Woolsthorpe, Lincoln- 
shire, United Kingdom, at Christmas 1642, according 
to the Julian calendar. At that time, the United King- 
dom — unlike most of the rest of Europe — had not 
yet adopted the Gregorian calendar, currently used 
throughout the world, and which was not to be 
adopted until 1752. Thus, translated into the mod- 
ern calendar, Isaac Newton's birth date is 4 January 
1643. 

Along with Gauss and Archimedes, Newton is con- 
sidered one of the three greatest mathematicians 
of all time. His greatest contribution to this science 
was the creation of calculus and its application to resolution of infinity of problems. Never- 


theless, it was his physics work that inscribed his name in gold letters across the annals of 


science. His work Mathematical Principles of Natural Philosophy (better known as Principia), 


is considered the greatest individual scientific work in history. It includes three fundamental 
themes: the three laws of motion, the law of universal gravitation and valuable contribu- 
tions to the structure of the Solar System. 

The scientist died in London on 31 March 1727 according to the Gregorian calendar and was 
buried in Westminster Abbey, near to which stands a monument to Newton. The monument'’s 
base contains a substantial epitaph. Here is an excerpt: “A Knight, who be a strength of mind 
almost divine, and mathematical principles peculiarly his own, explored the course and figures 
of the planets [...] Diligent, sagacious and faithful, in his expositions of nature [...] Mortals 


rejoice, that there has existed such.” 


Let us imagine that f¢, as in the previous examples, represents time, measured in 
seconds. The above table shows that when the time increases from 0 to 1, the value 
of y increases by two units, from 0 to 2, and when the time increases from 1 to 2, 
the value of y increases two more units, from 2 to 4. 

In summary, after every second, the function always increases by two units, so 
we can state that the rate of change of y is constant and equal to 2. In terms of 
the derivative, this tells us that if y=2t then y'=2. This fact is generalised in the 


following rule: 
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Rule 2: If y = Ct, where C is any number, then y' = C. 


Let’s now take the function y=t*. This formula could describe the distance 
that a stone descends vertically. We will develop this example on a hypothetical 
planet with less gravity than Earth to eliminate air resistance, and we will measure 
the time in seconds and the distance in metres. We refer to the distance travelled 
by the stone and not to the height of which it falls. They are different questions 
because when the stone falls, the value of the height decreases while the distance 
travelled increases. 

In both the case of the boiling water and the function y=2t, the rate of change 
of the function was either 0 or constant, which greatly simplifies the calculation 
of the derivative. However, in this new example, the rate of change varies at every 
instant, because the stone moves faster and faster. Therefore, calculation of the de- 
rivative is more complicated. 

Before we begin to calculate, for example, what is the velocity of the stone just 
3 seconds after being released, we will define what the velocity is. In the case of a 
car that takes 2 hours to travel a distance of 160 kilometres, we could say that the 
vehicle travelled at 80km/h, as the velocity is the result of comparing the distance 
travelled with the time it has taken to travel that distance: 


Velocity = ae = 80km/h 


Generalising this relationship, we could establish that: 


Distance travelled 
Velocity = —————_— 

Elapsed time 

But what if the car has not travelled at the same velocity all the time? Maybe 
at the beginning of the trip, it was travelling faster and then slowed down, or vice 
versa. Therefore, the relationship 


160 km _ 80 km/h 


Velocity = 
gives us an average velocity, but does not allow us to know what the velocity of 
the car is at every instant. How could we know what the velocity was 20 minutes 
after it had begun its journey? Or, returning to the previous example, how can we 


calculate the velocity of the stone at time f=3? It is not about getting the average 
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velocity over an entire time interval, but about finding the exact velocity at a given 
time. How can we link the expression that seeks to find the velocity over an elapsed 
time, with the calculation of the velocity at a single instant, in which the ‘elapsed 
time’ is 0? 

The way to do this is to first calculate the velocity, in the interval for exam- 
ple from 3 to 3.1 seconds, then the time interval between 3 and 3.01 seconds, 
then between 3 and 3.001 seconds and so on. That is, we will calculate the ve- 
locity along decreasing length intervals, until the elapsed time is reduced to 0, 
until it is transformed into a single instant. Following this idea, the table below 


shows the velocities of the stone for time intervals each time ‘closer’ to the ‘3 


seconds’ instant. 


Initial Final ; Position at a Velocity in 
: : Time Bae Position at the Distance : 
interval interval the initial : : this 
; elapsed : final instant travelled 
instant instant instant interval 


6.006001 
MS 
3.0001 s 0.0001 s 9.00060001 m | 0.00060001 m | 6.0001 m/s 


Velocity = distance travelled / Elapsed time. 


If we look at the last column of the table, we see that as time lapses, the calcu- 
lated velocity approaches 6 m/s. In fact, it can be shown mathematically that this 
value, 6 m/s, is exactly the velocity of the stone within 3 seconds of being released. 
The reasoning we have just outlined can be generalised to show that if y= t?, then 
y'=2t (note that for t= 3, the value that is obtained is 6). 


The above reasoning is summarised in rule 3: 
Rule 3: If y= Ct’, where C is any number, then y'=2Ct. 
If the stone was dropped on Earth (and not on a planet with lower gravity), 


then the function that calculates the distance travelled at any instant should take 


into account air resistance. This function is represented by y=5t? (in fact, it 
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would be more accurate to say that y = 4.9 but rounding up to 5 is accepted) 
and its derivative, according to the rule indicated above, is y'=2-°5t= 10t. 
In case we encounter higher powers of t,a development similar to the one we 


have just shown allows us to deduce: 


Rule 4: If y = Cf, then y'=3CP. 
Rule 5: If y = Cf’, then y'=4CP. 
Rule 6: If y = CP, then y'=5Cfr. 


And so on... 
But let us now return to the case of the ball that is thrown vertically upwards 


and to the temperature on a certain day of the previous summer and analyse both 


examples using the ‘mathematical tool’ that we have just introduced. 


OTHER RULES FOR FINDING DERIVATIVES 


In the text, we have only shown some of the many derivation rules in existence. One of the 


rules not mentioned is the one that refers to the product of two functions. It says that if y, 


and y, are two functions, then the derivative of its product is calculated as: 
YY») = V4)" + VY, 


Thus, if for the function y = t, y' = 1, we can deduce in a different way the derivative of y 
= t?. Indeed: 

(t?)' =(t-f'=t(t)'+(@' t=t-14+1t=t4+t=2t. 
Another rule tells us that the derivative of the addition (or subtraction) of two functions is 
the addition (or subtraction) of their respective derivatives. For example, the derivative of 


y=t?+tisy =2t+1. 


We recall that the formula that calculates the height of the ball in each of 
the moments between 0 and 4 seconds following its launch is y= 20t—5t*. And 
that the graph of the function showed us that, after being thrown, the ball went 
up during the next two seconds, at which point, the ball reached its maximum 
height and started its descent until it reached the ground again. How does this 


sequence of facts reflect the derivative? 
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Let us begin by observing that, according to rule 2, the derivative of 20t is 20 
and that according to rule 3, the derivative of 5f is 10¢. Thus, for y= 20t—5F, the 
derivative is y'=20—10t. 


y = 20-52 


For all values of t between 0 and 2, the formula y'=20—10t always gives a 
positive value. For example, for t=1 the derivative is 10, which indicates that the 
velocity of the ball at that time was 10 m/s. When the derivative is positive, the 
function is increasing, which means that — as we saw in the graph — the ball goes up. 

However, at t= 2, the derivative is 20—10-2 =0, and this tells us that the velocity 
at that instant is 0. Indeed, when the ball reaches its maximum height, it stops for 
an instant and then begins to fall. 

Finally, for all values of t between 2 and 4, the derivative is negative. For ex- 
ample, for t= 3, the derivative is 20—10-3=-—10 and therefore the velocity of the 
ball at that moment is —10 m/s. The negative sign indicates that the function is 
decreasing and that throughout this interval, the ball is descending. The following 


table summarises this reasoning: 


What can we say about the example of the temperature measured between 


noon and 3 in the afternoon? We recall that in this case, the formula is y= 30+ t—t? 
and its derivative is y'= 1—2t (the derivative of 30 is 0 according to rule 1, the de- 
rivative of tis 1 according to rule 2 and that of t? is 2t according to rule 3). 

In the example of the ball, it reaches its maximum height at the moment 


the derivative is 0. In the case of temperature, its maximum value is given when 
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the derivative is equal to 0, which happens when t = 0.5 (for that value of f, 
y'=1-2-0.5=0). In other words, in the period between noon and 3 pm, the 
highest temperature was recorded at 12.30 pm (0.5 hours after noon) and was 
y= 30 + 0.5-—0.57 = 30.25 °C. 

When the value of t is between 0 and 0.5, the derivative is positive. For example, 
for t = 0.1, y'=1 — 2-0.1=0.8. This tells us that during the interval, the tempera- 
ture has been increasing. As 0.1 hour equals 6 minutes, we can deduce that, exactly 
6 minutes after noon, the temperature was increasing at a rate of 0.8°C per hour. 
On the contrary, for t between 0.5 and 3, the derivative is negative, which tells us 


that in this time interval, the temperature was decreasing: 


The following figure shows the graph of the temperature function, which can 


be plotted from the information in the table. 


Maximum point of the curve 
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Graph of the function y = 30 + t —- t? with t between 0 and 3. 


112 


THE NUMBER e AND CALCULUS 


Although the two phenomena we have analysed are of a completely differ- 
ent nature (one is about throwing a ball and the other ambient temperature), the 
mathematical analysis of both has been very similar. This similarity reveals the true 
power of mathematics in general and of calculus in particular. Once the rules for 
mathematical operations are known, for example, rules for deriving functions, they 
serve to study any kind of phenomenon, no matter its nature, provided it can be 


described mathematically. 


The exponential function: link with the number e 


Now we will take a step forwards and will see how all of the above is related to the 
number e. For this, we shall consider the exponential function y=e', where t can be 
replaced by any real number and we will calculate the derivative of this function. 


We recall that, as we saw in chapter 2, this function can be written as follows: 


ef =—+—+—4+—4+—4+—4-- 


Oe 4h ot tae, St 


And that can also be expressed as: 


ELT BU gaa Scot ge a 
2 6 24 120 


If we successively apply all the previous rules, we obtain: 


y'=0+1+ ee ea 
6 24 120 
AEC ES SP ake Pee 
6 24 


ee: 
a} 


ea 


That is, the derivative of y=e' is y'=e'. Or, the derivative of the exponential 


function is the same function. 
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What are the consequences of this fact? On the one hand, e‘ is always a positive 
number and, consequently, y' is also positive, the function is growing all the time. 
Moreover, growth 1s fed back because the function and the derivative coincide. The 
higher the value of the function, the greater the growth rate. In turn, this causes 
an increase in the function, which further speeds up the growth rate, and so on. 
This feedback explains why the exponential function grows, as discussed in chapter 
2, in an irrepressibly ‘explosive’ manner. The figure below shows the graph of this 


function: 


Graph of the function y= e'. 


The function y= e'‘is the only one that coincides with its derivative, as all others 
are only variants of this. Like those of the form y= Ce‘, where C is any number, 
such as y= 2¢, y=—3e or y= 0.22. 

In chapter 2, we called exponentials a broader type of function, those of type 
y = Ce", where k and C are constants. But what about the derivatives of these 
functions? 

A similar reasoning to that of y=e' allows us to show that if y= Ce, then its 
derivative has the same formula, but multiplied by the constant k. That is, for 
y= Ce™, it follows that y'=kCe"=ky. In fact, the only functions that satisfy the 
equation y'= ky are all of the form y= Ce. Thus, for example, the only functions 
that satisfy the equation y'= 3y are all of the type y= Ce” (among them, y= 2e”, 
y=7e* or y=—3e°). 
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JUST A CURIOSITY? 


The formula A = ar* is a function that assigns to each positive value of r the area of the circle 
bearing that radius. Its derivative is A'= 2zr, which is the function that calculates the length 
of the circumference of radius r. Is this a rare coincidence? Well, no. There is a compelling 


reason why ‘the derivative of the circle’s area is its perimeter’. Let’s see what it is. 


Imagine a circle formed by an infinite number of concentric ‘glued together’ circumferences. 
This idea tells us that as the radius increases, new circumferences are added to the area of 
the circle. Therefore, the rate at which the area Is increasing is given by what these added 
circumferences measure. In other words, as we wanted to show, the derivative of the area is 
given by the lengths of the circumference. 

The same happens with the square in the image above, with a determined value r . In this 
case, the area is A = (2r)* = 4r2, whilst its derivative, A'= 8r, is the perimeter of the figure. 
The same thing happens with any regular polygon, when r equals the value of its apotheosis 
(the shortest distance between the centre of a polygon and any of its sides). 

We can also imagine a sphere formed by an infinite number of concentric spherical surfaces 
‘glued one to another’ and therefore, according to the reasoning just discussed, the deriv- 


ative of the volume of a sphere sould be equal to the area of its surface. And, in fact, it is. 


The volume of the sphere is calculated as V = Sa", and its derivative is V'=4nr? , which 


calculates the area of its surface. 


This fact allows us to analyse from a new point of view the example of the 
growth of bacterial populations. We stated that subject to the conditions of living 
space and unlimited food, bacteria populations increase at a rate that is, at any mo- 
ment, a fixed percentage of the population present at that time. Let’s say that, as on 


that occasion, that percentage is 20% and y is the function that at every instant it 
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gives us the quantity of bacteria present. The fact that the population growth rate is 


always 20% is mathematically expressed as: 


y = 027. 


Now, as we said above, the only functions that fulfil this relation are in the 
form y = Ce’*. In the case of bacteria, it can be shown that the value of C can 
only be the initial amount of bacteria in existence, because t=0 corresponds to 
y= Ce?*"°= Ce°= C_ In other words, the equation y'=0.2y tells us that the quantity 
of bacteria under infinite resource conditions can only be described by a single 
exponential function. 

On the other hand, the equation y'=0.2y also represents the evolution of an in- 
vested capital with 20% compound interest, as well as any other quantity that grows 
at a constant rate of 20%. In all cases, the function describing the phenomenon is 


necessarily exponential. 


Differential equations: physics expressed mathematically 


An equation such as y'=0.2y, which expresses the relationship between a 
function and its derivative is called a differential equation. Differential equations 
are an essential tool for translating into mathematical language the laws of 
physics, biology, chemistry, etc., as almost all such laws (as in the case of bac- 
teria) are expressed by the relationship that exists between a quantity and its 
rate of growth. 

It is important to mention that some of these natural laws express the relation- 
ship that exists between a quantity and its acceleration — the rate at which velocity varies. 
We note that if y' is the rate of change of y, then the variation of y' (the variation 
of its velocity, i.e. acceleration) is (y')', which is usually written y" and is called the 
second derivative of y. 

For example, from Newton’s laws, it follows that if a stone is dropped without 
taking into account air resistance, it will fall with constant acceleration. On Earth, 
that acceleration is approximately g= 10 m/s? (more precisely, it is 9.8 m/s”, but it is 
acceptable to round this up to 10). If we calculate the distance that the stone travels, 
the fact that the acceleration is constantly equal to g is expressed mathematically by 


the following differential equation: 


i 


| eee 
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If we assume that at the moment the stone is released, it has not yet travelled 
any distance and if we simply drop it without transmitting any additional impulse, 


the differential equation makes it possible to deduce that the distance travelled is 


1 2 2 
=p = 57". 
Y 78 


A PARACHUTIST FALLING 


We have discussed how to mathematically describe the fall of an object without taking into 
account air resistance, which is reasonable when it comes to a stone falling. But if the fall in 
question is that of a parachutist, then air resistance happens to be a factor that cannot be 
ignored. In this case, the differential equation expressing the velocity at which the parachutist 
falls (which we will call v) is: 
V =g-ky. 

Here, k is the coefficient for air friction, so the subtracted term indicates the force (countering 
acceleration) produced by air resistance and the parachute. In this case, it can be proved that 


the velocity of the parachute at each instant t is: 


y= rae e“)+v,e", 


where Vv, is the velocity at the moment the parachute is opened. 
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which coincides with what we have set out in one of the previous examples. (If the 
planet has a lower gravity, so that the acceleration due to gravity on its surface is 2 
m/s’, then the distance travelled by a stone that is dropped on its surface would be 
given by the formula y= t?, as we stated earlier.) 

The mathematical consequence that the functions of type y = Ce™ are prac- 
tically equal to their own derivatives is that they appear very frequently as 
part of the solution of many differential equations. This is the reason that the 
exponential function (and consequently, the number e) appears in the descrip- 
tion of many natural phenomena. As the mathematicians Richard Courant and 
Herbert Robbins state: 


The natural exponential function [referring to y=e'‘] is identical to its deriva- 
tive. This is the source of all the properties of the exponential function and the 


basic reason for its importance in applications. 


There is a problem known as the ‘catenary problem’, which can shed light on 
the issue. It consists of determining what form a rope held at its two ends takes. This 
might be a high voltage cable, a chain or a catenary, which gives the name to the 


problem as the word catenaria comes from catena, which is ‘chain’ in Latin. 


The catenary problem: 
which curve corresponds to the shape of a string supported from two points? 


This problem has been raised several times throughout history and was studied 
by, amongst others, Leonardo daVinci (1452-1519) and Galileo Galilei (1564-1642). 
But it was impossible to solve. The mathematical tools necessary for this were not 


developed until the 17th century. 
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It was Jacob Bernoulli who, in 1690, suggested that the concepts of calculus 
developed by Newton and Leibniz may perhaps find a solution to the cate- 
nary problem. And so it was: the following year, Gottfried Leibniz, Christiaan 
Huygens and Johann Bernoulli were able to find the answer independently. 
Mathematicians deduced that if y is the function whose graph corresponds to 
the desired curve, then this function has to fulfil the following relationship, 
arising from mathematically translating the action exerted by the forces acting 


on the string: 


In this equation, a is a constant that depends on the physical parameters of the 
string, cable or chain in question. And from it, it follows that the desired curve is a 


graph of the following function: 


A formula in which, once again, the number e is involved. This figure shows the 


graph of the function for different values of the constant a. 


Solution to the catenary problem. 
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How is the area calculated? 


Let us imagine that we are given a function whose graph, as shown in the following 


figure, is always above the horizontal axis. What will we do if we want to calculate 


the area for all values between t = a and t = b? 


For example, the problem asks us to calculate the area of the grey-shaded region. In this case, the 
curve corresponds to the function y = et. 


Thanks to one of the fundamental discoveries of mathematical analysis, we 
know that ‘area calculation’ is essentially the inverse operation of ‘calculating the 
rate of change’. As we have already seen, the derivative of the function y=? is 
y'=3t’, which means that the area under the graph of y'= 3? is calculated by the 


function y=?. 


To calculate the rate of change... 


rr 


y=Pr y' = 3t 


—— 


To calculate the area under the curve... 


Therefore, if we wanted to calculate the area of the region under the graph 
of y = 3f, between 1 and 2, as shown in the figure below, we would subtract the 
value of y = f at t= 1 from the value at t = 2. Consequently, the area sought is 
2>—1°=8-1=7. 
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1.2 


Area of the region under the graph of y = 3t?, between 7 and 2. 


In chapter 1, we saw that before Newton and Leibniz developed their works, 
Christiaan Huygens had already determined that the natural logarithm is the func- 


1 
tion that allows us to calculate the area under a hyperbola of the equation y =-—,as 
f 


shown below. Now, we can say that this is due to the fact that the derivative of 


y=In(d) is y' < ; 
t 


Biitineot won ccie tecaonocendanans i 3 3 
: illliiiibi ce ee : 

: nS SEES tone eoeanaeat sc cascre en 

: : : nibemeesocoet cccosonit ete note sinleaen 


| L. 
Area of the region under the graph of Y = : 


As the exponential function y = eé' is equal to its own derivative, this allows the 
area under the curve to be calculated. For example, the area of the region between 


Oand1ise!—& =e—-1: 
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: cigs” 


sessoisscs "35 os oe | a 5 Bi 05 
Area of the region under the graph y = et between 0 and 1. 


This fact allows us to reach a surprising conclusion. Suppose we want to calcu- 
late the area under the curve y= e‘ from ‘minus infinity’ to any value, as shown in the 
following figure. As stated above, this area is the subtraction of minus infinity from 
the value of the function y=e' where the function tends towards ‘minus infinity’, 
where the value at which the function approaches t takes on values that are ‘more 
negative’ each time (each time further ‘to the left’ on the horizontal axis), the graph 
itself shows that the function at ‘minus infinity’ is 0. Therefore, the area sought is 
e*—0 = e*. We conclude that, although the region is unlimited, because it veers to the 


left ‘without stopping’, the area it occupies is finite. 


Area of the region under the graph of y = e' between ‘minus infinity’ and any number a. 
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Created or discovered, the number e is omnipresent 


Throughout this book, we have seen how the number e appears in the description 
of numerous natural phenomena. Is this ubiquity of the number e a property of 
phenomena per se, or does it only refer to the language we use to describe them? 

For example, we can say that the ocean is ‘cold’ and that it is ‘deep’ and mar- 
vel at the fact that in both words the letter ‘f’ appears. However, this appearance 
is obviously a linguistic coincidence that is not related to the nature of the ocean 
itself. Is that idea applicable to the ubiquity of e, that is, is the number e an inherent 
property of phenomena, or is it the result of the mathematical language we use to 
describe them? 

The question is not a petty one and is related to a fundamental and philosoph- 
ical subject that questions the following: is maths created or discovered? In other 
words, are mathematical concepts a human creation, like language? Or do they exist 
independently and are constantly being discovered, just as helium gas was discovered 
in 1868, which has obviously existed since time immemorial? 

In this book, we have taken the position that mathematical concepts are dis- 
covered and that in some way the number e already existed before being ‘found’ by 
Jobst Burgi and Jacob Bernoulli. However, the opposite position is also sustainable, 
and the truth is that even today, mathematical philosophers have not yet agreed on 
which of the two positions is the ‘correct’ one to take. 

But, whether it exists by itself or whether it is borne out of how we describe 
natural phenomena, the fact is that the number e is essential to our knowledge of 
reality as we see, understand and transmit it. 

Without this extraordinary number, much of the work of many scientists 
and engineers would not be possible. That is why every time you notice how a 
high-voltage cable hangs from its two ends, or you calculate the interest accrued 
on a bank deposit or simply hear about the proliferation of bacteria, you have to 
remember that in these — and many other phenomena — in one way or another, the 


number e is present. 
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Appendices 


The first thousand decimal 
places of e 


The number e is approximately: 


ve 

7182818284 
9574966967 
2746639193 
5956307381 
1573834187 
8226480016 
5517027618 
6737113200 
9283681902 
7707854499 
7736178215 
2886939849 
3012381970 
5311802328 
9618188159 
8792284998 
8487560233 
4914631409 
7683964243 
0115747704 


5904523536 
6277240766 
2003059921 
3232862794 
9307021540 
8477411853 
3860626133 
7093287091 
5515108657 
6996794686 
4249992295 
6465105820 
6841614039 
7825098194 
3041690351 
9208680582 
6248270419 
3431738143 
7814059271 
1718986106 


0287471352 
3035354759 
8174135966 
3490763233 
8914993488 
7423454424 
1384583000 
2744374704 
4637721112 
4454905987 
7635148220 
9392398294 
7019837679 
5581530175 
5988885193 
5749279610 
7862320900 
6405462531 
4563549061 
8739696552 
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6624977572 
4571382178 
2904357290 
8298807531 
4167509244 
3710753907 
7520449338 
7230696977 
5238978442 
9316368892 
8269895193 
8879332036 
3206832823 
6717361332 
4580727386 
4841984443 
2160990235 
5209618369 
3031072085 
1267154688 


4709369995 


— 5251664274 


0334295260 
9525101901 
7614606680 
7744992069 
2656029760 
2093101416 
5056953696 
3009879312 
6680331825 
2509443117 
7646480429 
0698112509 
6738589422 
6346324496 
3043699418 
0888707016 
1038375051 
9570350354 
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Other curious and interesting 
facts about the number e 


There are a multitude of interesting and curious situations related to the number e. 
We have mentioned many of them in the text, but because the number e is inexhaust- 


ible, we propose here a list of other facts (approximately e??) related to the number e. 


1.The series that charmed Asimov 


In chapter 2, it was shown that the number 1/e is the result of a series, or a sum 


formed by an infinite number of terms, expressed as follows: 


hs gt eee Aa ed 


—-+--- 
e212). 34h SE 6h 4 


By the 1970s, this series sparked the interest of the writer Isaac Asimov (1920— 
1992), a famous author of numerous novels and science fiction short stories, as well 
as a large number of articles and books of popular science. 

Asimov, a biochemist and occasional amateur mathematician (“I love mathe- 
matics, but it is completely indifferent to me”, he wrote in one of his works), noted 


that in the series written above, the following transformation can be made: 


+ + + 
é 2. 3 ee Se. OTB! 
*-5-(a-a)-(e-a)-($-4)} 
cM <a at 6! wT ee 


The writer, of Russian origin, also observed the following: 
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and so on. By replacing each set of parentheses with the corresponding result, the 


series can be rewritten as follows: 


Definitely, a series that should highlight its beauty. Within it all terms, with the 
sole exception of the first, have the same sign (negative, in this case), and, as we can 
see, each of the natural numbers appears consecutively and only once. 

When Asimov presented this small but elegant discovery, he said “until someone 
stops me, I will call it the Asimov series”. To this day, we do not know of anyone 


who stopped him. 


2. Mnemonic tricks for fans of the number e 


We saw in chapter 4 that the number e is irrational and, therefore, has an infinite 
number of non-periodic decimal digits. Moreover, these digits do not follow any 
regular or recognisable pattern. An immediate consequence of all this is that it is 
truly difficult to commit to memory a large number of decimal places of e (as with 
mt, V2 or other irrational numbers). 

Now, for those who enjoy surprising their friends with their great power 
of retention, mnemonic rules have been invented to help remember the first 
digits of the decimal expression of e. An example? Memorise the following 


paragraph: 


If finding a sequence of numerals a struggle to memorise don’t worry reminisce. 
With music my it’s tough try baking. An exciting helpful song singing a fun 


rhyme is amusing. 


If we look at the number of letters in each of the 30 words in the paragraph, 
you will get a sequence that starts with 2 7 1 8 2 8...We must also keep in 
mind that each full stop (except the last one) is equal to zero. These digits are 
the first digits of e. Thus, if we memorise all the words in their exact order, we 
can easily recite the first digits of the number e, whose written decimal form 


begins as follows: 


2.7182818284590452353602874713527... 
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3. Almost, almost whole numbers 


Euler’s famous formula, analysed in chapter 2, tells us that e” is exactly equal to 
—1. Sometimes it also happens, by simple coincidence, that by raising the number e 
to a relatively simple power, a number very close to an integer (whole number) is 
achieved (not exactly an integer, as in the case of e’", but ‘almost’). 

An example of this is given by the number e’, which differs from an integer 
by a little over eight hundredths, given that e = 20.08553... But an even more 


*V16> which, as first discovered by the 


spectacular example is that of the number e 
Indian mathematician Srinivasa Ramanujan (1887-1920), is less than a trillionth 


away from an integer. Indeed: 


eniG o 262,537,412,640,768,743.99999999999925007... 


4. ‘Euler-style’ formulae 


In the previous point, we mentioned Euler’s famous formula, e”+ 1=0, which for 
many is the ‘most beautiful mathematical formula in history’. 

There are other formulae, perhaps not so famous or elegant, but which, in a way, 
are similar to Euler’s. Formulae whose validity can be demonstrated by reasoning 
similar to that which, at the time, let the prolific mathematician prove his own for- 
mula. Below are two examples. 


The first of these formulae says that: 


Its elegance lies in that it encompasses, in addition to e, = and i, the no less fa- 


mous irrational ./2 . Similarly, the second formula says that: 


se = 


e 


In this last equation V—2 should be understood as the complex numberV2i . 
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5. Two squared, three cubed... 


Let’s imagine that we are endeavouring to raise a positive number to the power in- 
dicated by that same number. In this way, we can calculate, for example, 2° = 4, 3° 
= 27, or 4* = 64. Let us also imagine that we do not limit ourselves to only working 
with integers, but allow any kind of positive number, whether rational or irrational; 


this means that we can also calculate 


1 1 
(-) 1 (<) 1 
_— = /— OF |— = 3/— . 
Zz Z Fe 3 


It is evident that if we calculate 27, 3°, 47, 5°, ... we will obtain numbers that are 
getting bigger, numbers that we could say ‘tend towards infinity’. But the inter- 
esting question is what is the power for which the result is the smallest of all? In 
other words, what number x should we choose if we want x* to give us the smallest 
possible result? 

The answer, curiously, is that the lowest result is obtained by taking x=-—. 
That is, of all the powers of the form x*, the smallest is : 


which approximately amounts to 0.6922... 


6. An infinite product of the roots of e 


1 
It can be shown that the series 1+—=+ 3 +—+—+—+ —+..., known as the harmonic 


series does not have a finite sum, which can be expressed as follows: 


a 
1+—+—+—+-—+-—+-—+...=@ 
ee Be Be 
y Sere Sees tee: eee 3 
But the series !— 7+ Z-—~+2-7T+=+--, known as the alternating harmonic 


a 8 5 eS 
series (because it alternates positive and negative signs), does have a finite result: 
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pote tes Se ..=In 
5 2b 6 6 


leh 
ak 
rene 

i) 


In it, In 2 (the natural logarithm of 2) is, by definition, the power at which the 


number e must be raised for the result to be 2. In the previous equation, therefore 


Sp ae Se ee 
1-—+—-—+—-—+—+... 
2 45 


SS ee ee 
ae De * 
etete *e* eB. 
from which we obtain the following, indeed very curious, infinite product of ‘roots 
of e’: 


. = 
eae vee Xe... 


This expression can also be written as follows: 


7. Irrational raised to irrational 


We have already said that the number ¢ is irrational, as are all powers of e with an 
integer exponent (provided it is not 0). We have, for example, that the numbers e?, 
e’, €' and e” are all irrational. Also irrational are all powers of e whose exponent is 
a square or cube root or any other power, for example, the numbers eY , e*” , e-Y? 
and e*"° 

However, to this day, it is not known whether the number é (‘e raised to the 
power e’) is irrational or not. Of course, no serious mathematician believes that & 
is rational. But its irrationality has not yet been demonstrated with the rigour re- 
quired by mathematical science. 

Now, a question we might ask is whether it is possible, a priori, that an irrational 
number raised to an irrational power results in a rational number. The answer is 
that, in principle, it is possible. To show this, we shall develop an example of two 
numbers, A and B, both irrational, so peeing A?® gives us a rational number. 

 Let’s start by considering the number i2” . In fact, this number may be rational 


or irrational. But the interesting aspect of this example is that, in order to develop 
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it, it is not necessary to know if of S is rational or not. In either case, we will find two 
irrational numbers A and B such that A? is rational. 

Let us first suppose that J2 = is rational. In this case, it suffices to state that 
A=~J2 and B=~V2 and, in effect, we will obtain two numbers A and B such that 
A® will be a rational number. 

But what do we do if, in reality J2 e , 1s irrational? In this case, we will take 
fe" and B= V2 , and so: 


a? =(J2")" VY = (4) =2. 


Therefore, A and B are irrational, whereas A? is rational. 
In short, it does not matter if J2~° is or is not rational, because in either case 


two irrational numbers A and B are obtained, such that A? is rational. 


8.‘A raised to the power B’ 


In the previous point, we talked about J/2 = , the ‘square root of 2 raised to the 
power square root of 2’, but how is that number defined? The answer is by no 
means obvious. In fact, A? = A-A (A multiplied by itself twice), A°= A-A-A (A 
multiplied by itself three times), A* = A-A-A-A (A multiplied by itself four times) 
and so on. But this idea cannot be extended to define AY” . We cannot say that 
“AY? is the result of multiplying A by itself V2 times’, as the phrase ‘V2 times’ does 
not make sense. 

There are many ways to define A®, where A and B are any numbers (in reality, 
A must be positive), but one of the most popular of these involves the number e. 


The definition states that: 


If we take into account, as we saw in chapter 2, that: 


2 3 4 5 6 7 
x x Xx x 


< RE 
e =1+x+—+—+—4+—4+—4+— 
ZA ae eae 


Replacing x by B In A, we deduce that: 


2 2 4 5 
A = -1+Bina+ See) , Snare aa. 


3! 4! fae Dl 
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Therefore, taking A=J/2 and B= 2: 


Poe OT eae 1+ V2 In y2+ WP nv2y _ 2 nv2)’ _ 2 nv2)" + 


3! 4! 
(V2Inv2)° 
5! 


9. Other than infinite sums 


As we recall from the previous point, we know that: 


2 3 4 5 6 7 
x x x x 


ere 
e =1+x+—+—+—+—+—+— 
Zs Si SH OF) 
On the other hand, we know that e"? = 2. If we combine the two equations, we 


obtain the following, curious, infinite sum: 


2 3 + 5 6 7 
t+ in24 cna) 4 On) (2) (e2y’ (nay 2) nn 
2! 3! 4! 5! 6! 7! 


Or also: 


2 3 4+ 5 6 oe 
Ina 4 2n2) , Gn2y" | (n2)" | (2) | dn2)" | (m2) 
2! 3! 4! 5! 6! 7! 


10. Transcendental, more transcendental, equal to... 


In chapter 4, we spoke of algebraic and transcendental numbers. We remember that 
J2 and 3/7 are algebraic, whilst e, 1 and In 2 are transcendental. 

By adding two algebraic numbers, it can be shown that the result is always 
another algebraic number. For example, J2 + 3/7 is algebraic. It also happens that 
if we add an algebraic number and a transcendental number, the result is always a 
transcendental number — for example, e+J2 is transcendental. 

However, the result of the sum of two transcendental numbers can be both 
a transcendental number and an algebraic number. For example, V2 -e is tran- 
scendental and e also is, but its sum (/2 -e)+e = 2 is an algebraic number. 
On the other hand, e* + e’, the sum of two transcendents, is a transcendental 


number. 
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But what happens to the sum of the two ‘most famous transcendental num- 
bers’, e and 1? The truth is that it is not yet known whether e + Zt is transcendental 
or not. It is not even known whether it is rational or not. All mathematicians 
assume that e + Zt is irrational and transcendental, but this has yet to be mathe- 


matically proven. 


11. Squaring of the hyperbola 


In chapter 4, we saw that the problem of squaring the circle is unsolvable. We recall 
that, given any circle, this problem concerns constructing — using only a non-grad- 
uated ruler and a compass — a square that has the same area as a circle. When we 
say that this problem is unsolvable, what we are saying is that the construction is 
unrealisable. 

But the circle, as we shall see below, is not the only figure that cannot be 


squared. Consider the following image: 


The curve is the hyperbola of the equation y = 1/x, and as we saw in chapter 
1, the area of the shaded region is In2. As this number is transcendental, it is not 
possible to construct with a ruler and compass a square with the same area. In other 
words, with these instruments, the shaded area in the image cannot be squared. On 
the other hand, if we extend the base of the drawing until it reaches the number 
e, instead of 2, the area of the figure will then be In e = 1 and the image could be 
squared... However, the base segment of the drawing of length e — 1 could not be 


drawn with a ruler and compass. 
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12. Mathematical language in science fiction 


Science fiction usually assumes, probably rightly, that if, in the future, we come 
into contact with an extra-terrestrial civilisation, mathematics could be used to 
communicate with it. There are countless science fiction novels or films in which 
prime numbers, 1 digits or theorems of elementary geometry, for example, are used 
as a ‘lingua franca’. 

In the novel The Mote in God’s Eye, by the American writers Larry Niven 
and Jerry Pournelle, humanity finds itself for the first time with a technolog- 
ically advanced alien civilisation. In their first radio contact, the humans send 
the first decimal digits of = and the aliens respond with the first digits of e, but 
written in base 12, which they use because on their three hands, they have 12 


fingers in total. 


13. In other bases 


The first hundred digits of e in base 10 are: 


2.71828182845904523536028747 1352662497757247093699959574966967 
627724076630353547594571382178525166427. 


The first hundred digits of e in binary are: 


10.10110111111000010101000101100010100010101110110100101010011 
010101011111101110001010110001000000010. 


The first hundred digits of e in base 3 are: 


2.201101121221102011012222102011021222201202222210212212020112 
11222111000120222211210210201002202022. 


The first hundred digits of e in base 16 (or hexadecimal system) are: 
2.B7E151628AED2A6ABF7158809CF4F3C762E7160F38B- 


4DA56A784D9045 190 CFEF324E7738926CFBE5F4BF8D8D8C31D- 
763DA. 
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